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5 Nucl ic Acid Encoding 

Human Neuronal Calcium Channel Subunits 

Background of the Invention 

10 Calcium channels are multi-subuni t protein 

complexes that span the plasma membrane and are 
involved in the movement of calcium ions into the 
cell. Voltage-dependent calcium channels, the most 
common type of calcium channels, are classified as 
15 T-, N-, or P-type channels, based on conductance 

levels, sensitivity to agonists and antagonists, and 
holding potential (K. Dunlap et al., Trends Neurosci. 
jg : 89-98 (1995)). Calcium channels contain two large 
subunits, al and a2, having molecular weights between 
20 about 130 and about 200 kDa, and one to three smaller 
subunits, such as b r and/or g subunits, each having a 
molecular weight that is usually less than about 60 
kDa, At least one of the large subunits is 
glycosylated, and a smaller subunit may be 
25 glycosylated as well. Subunit al is approximately 200 
to about 230 kDa, based on sodium dodecylsulf ate 
polyacrylamide gel electrophoresis (SDS-PAGE) . This 
subunit forms the pore through which calcium enters 
cells. Subunit a2 is approximately 160 to 190 kDa 
30 under non-reducing conditions on SDS-PAGE. The b 
subunit is about 52 to 65 kDa (SDS-PAGE); it is 
insensitive to reducing conditions- The g subunit, 
which is not observed in nervous tissue or in other 
certain preparations, is a glycoprotein of 
35 approximately 30 to 33 kDa (SDS-PAGE) . 

Investigation of particular calcium channel 
subtypes is rendered difficult by the presence of a 
mixture of different tissue-specific types of calcium 
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5 channels in cells. Study of particular subtypes is 
essential, however, because of the importance of 
intracellular calcium levels in contributing to vital 
cellular processes including neurotransmitter release, 
muscle contraction, pacemaker activity, and secretion 

10 of hormones and other substances. A need remains for 
identifying and studying individual calcium channel 
subtypes . 

Summary of 'the Invention 

The current invention pertains to the isolation 
15 and sequencing of nucleic acids encoding three 

subunits of human N-type calcium channel: an alB 
subunit, an a2d subunit, and a b3 subunit . Previously 
unknown alterations are present in the sequence of 
nucleotides encoding each of the subunits . The 
20 nucleic acid encoding the alB subunit has a change 

from G to A at position 194; a change from T to G at 
position 2559; a change from G to A at position 6470; 
and a deletion of nucleotides 4814-4819. The nucleic 
acid encoding the a2d subunit has a change from A to C 
25 at position 329; a change from G to C at position 

1191; a change from G to C at position 1219; a change 
from T to C at position 1596; a change from T to C at 
position 1980; a change from A to G at position 2090; 
and a change from A to G at position 3261. The 
30 nucleic acid encoding the b3 subunit has a change from 
C to G at position 4 6; a TCC insertion at position 
119; a change from A to T at position 203; a change 
from C to T at position 300; a change from C to G at 
position 303; a change from A to G at position 420; a 
35 change from C to T at position 438; a change from T to 
C at position 477; a change from T to G at position 
486; a chang from G to A at position 534; a change 
from A to C at position 552; a change from G to T at 
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5 position 561; an ATG insertion at position 978; a 

change from T to A at position 1064; a change from CG 
to GC at positions 1283-1284; or a change from C to T 
at position 1308. Certain of these changes result in 
amino acid alterations in the encoded proteins, while 
10 others are synonymous changes. 

Vectors containing the nucleic acids encoding the 
subunits described above have been prepared, as have 
host cells containing the nucleic acids. Methods of 
isolating nucleic acids encoding related calcium 
15 channel subunits, by employing hybridization of the 
nucleic acids of the invention to nucleic acid 
libraries, are now available by virtue of the 
discoveries described herein. Also available are the 
subunit proteins encoded by the nucleic acids, and 
20 also fusion proteins comprising the subunit proteins. 
Antibodies to the subunit proteins can also be 
generated. Assays to identify agents that modulate 
calcium channel activity are described, in which test 
cells are exposed to the agent to be tested and a 
25 calcium channel-selective ion; depolarizing the cell 
membrane of the test cell; detecting current flowing 
into the cell; and comparing the current to that of a 
control cell, wherein a difference in the current 
detected in the test cell, as compared with the 
30 current of the control cell, indicates that the agent 
modulates calcium channel activity. In addition, 
methods of diagnosing diseases associated with 
particular calcium channels, such as Lambert-Eaton 
syndrome, are described, using assays to detect the 
35 presence of calcium channel-specific antibodies in a 
sample from the individual suspected of having the 
disease. 
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5 The changes that are present in the subunits 

described herein may produce functional differences in 
the calcium channel formed by the three subunits, 
which will have an effect on the interaction between 
the calcium channels and agonists or antagonists of 
10 the channels. Furthermore, calcium channel subunits 
described herein are an advantageous combination, 
because the b3 subunit is normally found associated 
with an alB subunit in vivo/ thus, this combination 
closely resembles the calcium channel in vivo. 
15 Brief Description of the Drawings 

Figure 1 depicts the nucleic acid (SEQ ID NO. 1) 
and encoded amino acid (SEQ ID NO. 2) of a human alB 
subunit of the human calcium channel. 

Figure 2 depicts the nucleic acid (SEQ ID NO. 3) 
20 and encoded amino acid (SEQ ID NO. 4) of a human a2d 
subunit of the human calcium channel. 

Figure 3 depicts the nucleic acid (SEQ ID NO. 5) 
and encoded amino acid (SEQ ID NO. 6) of a human b3 
subunit of the human calcium channel. 
25 Figure 4 is a graphic representation of average 

counts per minute (CPM) for omega-conotoxin GVIA- 
sensitive potassium-stimulated calcium uptake in cells 
transfected with the three human calcium channel 
subunits . 

30 Detailed Description of the Invention 

The current invention pertains to the isolation 
and identification of DNA encoding subunits of a 
particular calcium channel." As described in the 
Examples below, cDNA encoding an alB subunit, an a2d 
35 subunit, and a b3 subunit of the human N-type calcium 
channel has been isolated. The cDNA clones have been 
inserted into expression* vectors, and are stably 
expressed in transformed cell lines. The resulting 
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5 transformed cells express each of the three mRNA 
encoding the individual subunits of the N-type 
channel. The transformed cells show omega-conotoxin 
GVIA binding activity, and omega-conotoxin GVIA toxin 
sensitive potassium-stimulated calcium uptake, 
10 indicating that the proteins expressed by the clones 
are capable of forming a functioning calcium channel. 

. As .a result of this discovery, nucleic acids 
encoding the three subunits, as well as vectors 
containing the nucleic a-eids encoding one or more of 
15 the subunits, host cells containing the nucleic acids 
encoding one or more of the subunits, and methods of 
isolating nucleic acids encoding related calcium 
channel subunits are now available. The subunit 
proteins, fusion proteins comprising the subunit 
20 proteins, and antibodies to the subunit proteins, as 
well as assays to identify agents that modulate 
calcium channel activity, are also described. Agents 
that modulate calcium channel activity can be used to 
treat certain central nervous system disorders by 
25 altering calcium channel activity. In addition, 
methods of diagnosing diseases associated with 
particular calcium channels, such as Lambert-Eaton 
syndrome, are described. 

A "nucleic acid encoding a calcium channel 
30 subunit", as used herein, is a. sequence of nucleotides 
which encodes either an alB subunit, an a2d subunit, 
or a b3 subunit of the N-type calcium channel. 
Nucleic acid encoding a calcium channel subunit can be 
either cDNA, DNA or mRNA. The nucleic acid encoding a 
35 calcium channel subunit is "isolated," indicating that 
it has been purified according to standard techniques 
known in. the art (for example, such as by techniques 
described by Sambrook et al . (eds) , in Molecular 
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5 Cloning: A Laboratory Manual, (2nd ed.), Cold Spring 
Harbor Laboratory Press (1989)). 

In one embodiment, nucleic acid encoding a 
calcium channel alB subunit has the sequence of 
nucleotides shown in Figure 1 (SEQ ID NO. 1) . In 
10 another embodiment, nucleic acid encoding a calcium 
channel alB subunit is a nucleic acid encoding a 
functional equivalent of the subunit encoded by 
sequence of nucleotides shown in Figure 1 (SEQ ID NO. 
1) . A "functional equivalent" has the same function 
15 as the calcium channel subunit, but is encoded by a 
nucleic acid thac may have minor variations in the 
sequence of nucleotides, in comparison to the nucleic 
acid encoding the subunit. A nucleic acid encoding a 
functional equivalent is referred to herein as an 
20 "equivalent" nucleic acid. Minor variations in 
equivalent nucleic acids include variations that 
result in no alteration of the encoded amino acid 
sequence (synonymous changes); variations that result 
in conservative amino acid substitutions in the 
25 encoded amino acid sequence; and/or minor deletions or 
insertions of nucleotides that do not alter the 
activity of the peptide. Such changes are readily 
known to the skilled artisan. For example, 
representative conservative amino acid changes 
30 include: alanine to glycine or serine; arginine to 

lysine; asparagine to glutamine or histidine; cysteine 
to serine; glutamine to asparagine; glutamate to 
aspartate; glycine to alanine or proline; histidine to 
asparagine or glutamine; isoleucine to leucine or 
35 valine; leucine to isoleucine or valine; lysine to 

arginine or glutamine; methionine to leucine tyrosine, 
or isoleucine; phenylalanine: to methionine, leucine to 
tyrosine; serine to threonine; threonine to serine; 
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5 tryptophan to tyrosine; tyrosine to tryptophan or 
phenylalanine; valine to isoleucine or leucine. 

A functional equivalent of a subunit has an 
activity that is equivalent to the subunit. 
"Activity" refers to the ability of the peptide to 
10 form a functional calcium channel with other necessary 
subunits. A functional calcium channel is a calcium 
channel that is able to provide for and regulate entry 
of calcium channel-selective ions, including calcium, 
ir. respon-5© to appropriate stimuli, and/or is able to 

15 bind ligands having affinity for the calcium channel. 
The activity of a calcium channel may be assessed in . 
vitro by standard methods, such as 

electrophysiological or other methods described below. 
Nucleic acids encoding a functional equivalent of 
20 a calcium channel alB subunit, include at least one of 
the following alterations, in comparison to the 
nucleic acid sequence in the HSCACHNT Genbank file 
(see Ellis, S.B. et al . , Science 257:389-395 (1992)): 
a change from G to A change at position 194; a change 
25 from T to G at position 2559; a change from G to A at 
position 6470; or a deletion of nucleotides 4814-4819. 
The nucleic acid having the sequence of nucleotides 
shown in Figure 1 (SEQ ID NO. 1), as well as nucleic 
acids encoding a functional equivalent of the subunit 
30 encoded by the sequence of nucleotides shown in Figure 
1 (SEQ ID NO. 1), are collectively referred to herein 
as a nucleic acid encoding a calcium channel alB 
subunit. 

A nucleic acid encoding a calcium channel a2d 
35 subunit has the sequence of nucleotides shown in 

Figure 2 (SEQ ID NO. 3), or a s quence of nucleotides 
encoding a functional equivalent of the subunit 
encoded by the nucleic acid shown in Figure 2 (SEQ ID 
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5 NO. 3) . A nucleic acid encoding a functional 

equivalent of the subunit encoded by the sequence of 
nucleotides shown in Figure 2 (SEQ ID NO. 3) includes 
at least one of the following alterations, in 
comparison to the HSCACNLB EMBL file (Harpold, M.M. et 
10 al., Neuron 8:71-84 (1992)): a change from A to C at 
position 329; a change from G to C at position 1191; a 
change from G to C at position 1219; a change from T 
to C at position 1596; a change from T to C at 
position 1980; a change from A to G at position 2090; 
15 or a change from A to G at position 3261. 

A nucleic acid encoding a calcium channel b3 
subunit has the sequence of nucleotides shown in 
Figure 3 (SEQ ID NO. 5), or the sequence of 
nucleotides encoding a functional equivalent of the 
20 subunit encoded by the nucleic acid shown in Figure 3 
(SEQ ID NO. 5) . A sequence of nucleotides encoding a 
functional equivalent of the subunit encoded by the 
nucleic acid shown in Figure 3 (SEQ ID NO. 5) includes 
at least one of the following alterations, in 
25 comparison to the HUMCALBA Genbank file (see Collin, 

T. et al., Eur. J. Biochem. 220 (1) : 257-262 (1994)): a 
change from C to G at position 4 6; a TCC insertion at 
position 119; a change from A to T at position 203; a 
change from C to T at position 300; a change from C to 
G at position 303; a change from A to G at position 
420; a change from C to T at position 438; a change 
from T to C at position 477; a change from T to G at 
position 48 6; a change from G to A at position 534; a 
change from A to C at position 552; a change from G to 
T at position 561; an ATG insertion at position 978; a 
change from T to A at position 1064; a change from CG 
to GC at positions 1283-1284; or a change from C to T 
at position 1308. 



30 



35 
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5 The nucleic acids encoding calcium channel 

subunits of the invention can be used to isolate other 
nucleic acids encoding related subunits. For example, 
all or a portion of one of the nucleic acids encoding 
a calcium channel subunit can be used as a probe to 
10 isolate nucleic acids from a nucleic acid library by 
hybridization techniques. A "portion" of the nucleic 
acid -indicates a part of the nucleic acid that 
contains one of the specific alterations described 
above. Stringency conditions should be tailored to 
15 eliminate hybridization of the probes to extraneous 
nucleic acid sequences (see Sambroolc ec al. (eds), 
Molecular Cloning: A Laboratory Manual , (2nd ed.), 
Cold Spring Harbor Laboratory Press (1989), 
particularly chapter 11.45). In a preferred 
20 embodiment, stringency conditions are selected such 
that the nucleic acid encoding a calcium channel 
subunit, or a portion of the nucleic acid, selectively 
hybridizes to a second nucleic acid (the target 
nucleic acid) . "Selective hybridization" indicates 
25 that the hybridization is of sufficient specificity to 
allow the target nucleic acid to be identified or 
isolated from other nucleic acids. Generally, medium 
stringency conditions will allow selective 
hybridization. Nucleic acids that encode a calcium 
30 channel subunit, and are capable of selectively 
hybridizing, under medium or high stringency 
conditions, to all or a portion of a nucleic acid 
encoding a calcium channel subunit of the invention, 
or to all or a portion of a nucleic acid encoding a 
35 functional equivalent of a calcium channel subunit of 
the invention, are also encompassed by the invention. 

Nucleic acids encoding, a calcium channel alB 
subunit, a calcium channel a2d subunit, or a calcium 
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5 channel b3 subunit, as described above, can be 

inserted into a vector to facilitate expression. The 
vector is capable of expressing nucleic acids that are 
in operative linkage with endogenous or exogenous 
regulatory sequences; it may be a plasmid, a phage, a 
10 virus, or other vector. The vector can contain other 
elements, such as transcriptional promoter elements, 
enhancer elements, splicing signals, termination and 
polyadenylation signals, viral replicons, and/or 
bacterial plasmid sequences. Upon introduction of the 
15 vector into a host cell, the nucleic acid inserted 

into the vector is expressed. The vector can contain 
more than one nucleic acid encoding a calcium channel 
subunit of the invention. For example, a vector can 
contain nucleic acids encoding a calcium channel alB 
20 subunit, as well as nucleic acids encoding a calcium 
channel a2d subunit. If only one subunit is used, it 
is understood that other known sequences are present 
to encode a functional protein. 

A vector containing the nucleic acids encoding a 
25 calcium channel subunit of the invention, as described 
above, can be transformed or transfected into an 
appropriate host cell for expression. Alternatively, 
a nucleic acid encoding a calcium channel subunit of 
the invention can be inserted directly into the host 
30 cell. The nucleic acids can be introduced into the 
cell in a manner such that they are integrated into 
the host cell genome; alternatively, they can be 
maintained episomally. 

Representative host cells include Escherichia 
35 coli, HEK 293 cells, Chinese hamster ovary (CHO) 
cells, African green monkey cells, mouse L cells, 
amphibian oocytes, and CHODUX cells (Mitchell, P.J. et 
al., Mol. Cell Biol. 6(6)1926-35 (1986)). In a 
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5 preferred embodiment, the host cell does not naturally 
contain nucleic acids encoding, or produce calcium 
channels comprising, alB, a2d, or b3 subunits, in 
order to facilitate distinguishing the nucleic acids 
and encoded subunits of the invention from other, 
10 native nucleic acids and subunits. In a more 

preferred embodiment, the host cell does not express 
or produce endogenous calcium channel subunits of the 
type, or in an amount, that substantially interferes 
■with detection of t-h-e nucleic acids and encoded 
15 subunits of the invention. 

A single al subunit is sufficient to form a 
calcium channel; therefore, at least the nucleic acid 
encoding the alB subunit is introduced into the host 
cell. In a preferred embodiment, nucleic acids 
20 encoding each of the three subunits are introduced 

into the host cell such that the host cell expresses - 
the subunits and includes one or more of them in 
membrane-spanning calcium channels. In a preferred 
embodiment, the host cell expresses functional calcium 
25 channels that are capable of controlling movement of 
calcium channel-selective ions and/or binding 
compounds. In a more preferred embodiment, the 
calcium channels are composed substantially or 
entirely of the three subunits encoded by the nucleic 
30 acids of the invention, in order to generate a calcium 
channel that is closer to the normal physiologic state 
of the channel in the mammalian central nervous 
system. A host cell which has been transformed or 
transfected as described above is also referred to 
35 herein as a transformed cell. 

Host cells transformed or transfected with 
nucleic acids encoding one or more of the calcium 
channel subunits of the invention can be used for 



WO 98/11131 



PCTAJS97/16146 



12 

5 screening for compounds that modulate calcium channel 
activity. Because the host cells have a homogeneous 
population of calcium channels, they provide a means 
to identify agents that specifically modulate activity 
of the particular calcium channels. An agent that 
10 modulates (e.g., enhances or upregulates, or inhibits 
or downregulates) calcium channel activity is an agent 
that affects the ability of the calcium channel to 
pass calcium channel-selective ions, or affects other 
detectable calcium channel characteristics, such as 
15 current kinetics. The agent may affect the calcium 
channel directly or indirectly. 

For example, transformed cells can be used in 
assays that identify agents that are agonists or 
antagonists of calcium channel activity. To identify 
20 agents that modulate calcium channel activity, a 

transformed cell (or a culture of transformed cells) , 
used as a "test" cell, is maintained in a solution 
containing an agent to be tested for its ability to 
modulate calcium channel activity (the test agent) and 
25 a calcium channel selective ion. A "calcium channel 
selective ion" refers to an ion that is capable of 
flowing through, or being blocked from flowing 
through, a calcium channel which spans a cellular 
membrane under conditions which would permit or block 
30 the flow of calcium ions. Ba 2 * is an example of a 

calcium channel selective ion. The cell membrane of 
the test cell is then depolarized, and current flowing 
into the test cell is detected. If the current that 
is detected is different from the current produced by 
35 depolarizing the same cell or a control cell in the 
presence of the same calcium channel selective ion, 
but in the absence of the compound, then the agent 
modulates calcium channel activity. In a preferred 
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10 



30 



35 



embodiment, the test cell is maintained at a holding 
potential which substantially inactivates calcium 
channels prior to the depolarization step. If the 
current is higher in the presence of the agent than in 
the absence of the agent, then the agent is an agent 
that enhances calcium channel activity (a calcium 
channel agonist) . If the current is lower in the 
presence of the agent than in the absence of the 
agent, then the agent is an agent that inhibits 
calcium channel activity (a calcium channel 

15 antagonist) . 

One "control" cell which can be used as described 
above, is a cell that is maintained in substantially 
the same manner as the test cell, with the exception 
that the control cell is not exposed to the agent to 
20 be tested. An alternative "control" cell is a cell 
which is identical to the test cell, except that it 
does not express functional calcium channels. 

Agents identified by these methods can be used to 
modulate activity of calcium channels in vivo. The in 
25 vitro assays described above should accurately predict 
relative efficacy of an agent as an agonist or an 
antagonist of calcium channels, since the calcium 
channel subunits described herein are subtype and 
tissue-specific. Specific disease targets include 
central nervous system disorders, including stroke, 
cerebral ischemia, epilepsy, chronic pain, head 
trauma, and other central nervous system diseases or 
conditions in which too much or too little 
neurotransmitter is released. The agent is 
administered by an appropriate route, such as orally, 
subcutaneously, transdermally , intravenously, 
intramuscularly, intraperitoneally , topically, 
rectally, vaginally, nasally, or via an implanted 
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5 reservoir. The agent can be administered in dosage 
formulations containing conventional, non-toxic, 
physiologically-acceptable carriers, adjuvants, and/or 
vehicles. The formulation in which the agent is 
administered will depend at least in part on the route 
10 by which it is administered. The agent is 

administered in an effective amount, which is that 
amount necessary to alleviate, reduce, eliminate, or 
prevent the symptoms associated with the disease, 
disorder or condition to be treated. More than one 
15 agent can be administered; if more than agent is used, 
the effective amount is that amount of the combination 
of agents that is necessary to alleviate, reduce, 
eliminate or prevent the symptoms associated with 
disease, disorder or condition. The effective amount 
20 will be determined on an individual basis, and will be 
based in part, on consideration of the particular 
agent, the individual's size and gender, the severity 
of the symptoms to be treated, the result sought, and 
the disease, disorder or condition to be treated. The 
25 effective amount can be administered in a series of 
doses separated by appropriate intervals, such as 
hours, days, or weeks. Alternatively, the effective 
amount can be administered as a sustained release 
dose, such as by a controlled-release dosage 
30 formulation. 

Purified proteins encoded by a nucleic acid 
encoding a calcium channel alB subunit,. a. calcium 
channel a2d subunit, or a calcium channel b3 subunit, 
as described above, are also described. The proteins 
35 (also referred to herein as calcium channel subunits 
of the invention) can be isolated from a host cell 
transfected or transformed with the nucleic acid 
encoding the subunit. Representative proteins include 



BNSDOCID: <WO 981 1 131A2_I_> 



WO 98/11131 



PCT/US97/16146 



15 

5 a calcium channel alB subunit having the amino acid 
sequence shown in Figure 1 (SEQ ID NO. 2); a calcium 
channel a2d subunit having the amino acid sequence 
shown in Figure 2 (SEQ ID NO. 4); or a calcium channel 
b3 subunit having the amino acid sequence shown in 
10 Figure 3 (SEQ ID NO. 6). 

Fusion proteins comprising the calcium channel 
alB subunit, a . calcium channel a2d subunit, or a 
calcium channel b3 subunit can also be generated using 
standard techniques. For example, a fusion nucleic 
15 acid can be generated by splicing, or attaching the 

nucleic acid encoding the calcium channel subunit to a 
nucleic acid encoding another protein or peptide (the 
fusion partner or protein), or by inserting the 
nucleic acid encoding the calcium channel subunit and 
20 the fusion protein into a common vector ; the fusion 

nucleic acid can then be transformed, transfected, or 
inserted into a hose cell for transcription and 
translation. 

Antibodies (or immunoglobulins) to the calcium 
25 channel subunit s of the invention can be generated. 

The term. "antibody", as used herein, encompasses both 
polyclonal and monoclonal antibodies, as well as 
mixtures of more than one antibody reactive with a 
calcium channel subunit of the invention (e.g., a 
30 cocktail of different types of monoclonal antibodies 

reactive with the mutant protein or protein fragment) . 
The term antibody is further, intended to encompass 
whole antibodies and/or biologically functional 
fragments thereof, chimeric antibodies comprising 
35 portions from more than one species, humanized 

antibodies, human-like antibodies, and bifunctional 
antibodies. Biologically functional antibody 
fragments which can.be used are those fragments 
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5 sufficient for binding of the antibody fragment to the 
calcium channel subunit of interest. Once the 
antibodies are raised, they are assessed for the 
ability to bind to the calcium channel subunit of 
interest. Conventional methods can be used to perform 
10 this assessment. Antibodies can also be raised to 
calcium channels formed by the combination of the 
calcium channel alB subunit, the calcium channel a2d 
subunit, and the calcium channel b3 subunit described 
herein. 

15 The chimeric antibodies can comprise portions 

derived from two different species (e.g., a constant 
region from one species and variable or binding 
regions from another species) . The portions derived 
from two different species can be joined together 

20 chemically by conventional techniques or can be 

prepared as single contiguous proteins using genetic 
engineering techniques. DNA encoding the proteins of 
both the light chain and heavy chain portions of the 
chimeric antibody can be expressed as contiguous 

25 proteins. 

Monoclonal antibodies (mAb) reactive with a 
calcium channel subunit of the invention, or a calcium 
channel formed by the subunits, can be produced using 
somatic cell hybridization techniques (Kohler and 
30 Milstein,' Nature 256: 495-4 97 ( 1975 ) ) or other 

techniques. In a typical hybridization procedure, 
purified calcium channel subunit, or calcium channels, 
can be used as the immunogen. An animal is immunized 
with the immunogen to obtain antibody-producing spleen 
35 cells. The species of animal immunized will vary 
depending on the specificity of mAb desired. The 
antibody producing cell is fused with an immortalizing 
cell (e.g., a myeloma cell) to create a hybridoma 
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5 capable of secreting antibodies. The unfused residual 
antibody-producing cells and immortalizing cells are 
eliminated. Hybridomas producing desired antibodies 
are selected using conventional techniques and the 
selected hybridomas are cloned and cultured. 
10 Polyclonal antibodies can be prepared by 

immunizing an animal in a similar fashion as described 
above for- the production of monoclonal antibodies. 
The animal is maintained under conditions whereby 
antibodies reactive with the calcium channel subunit 
15 of interest, or the calcium channel, are produced. 
Blood is collected from the animal upon reaching a 
desired titer of antibodies. The serum containing the 
polyclonal antibodies (antisera) is separated from the 
other blood components. The polyclonal antibody- 
20 containing serum can optionally be further separated 
into fractions of particular types of antibodies 
(e.g. , igG, igM) . 

Antibodies that are specific for the calcium 
channel subunits of the invention, or for the calcium 
25 channel formed by the three subunits of the invention, 
can be used for immunohistochemistry to monitor 
distribution and expression" density of the various 
subunits, or of the calcium channels themselves, in 
different tissues, including in normal and in diseased 
30 tissue. The antibodies can also be used as a 

therapeutic agent, .in order to modulate calcium 
channel activity, as described in detail above. 

Antibodies that are specific for the calcium 
channel subunits of the invention, or for the calcium 
35 channel formed by the three subunits of the invention, 
can also be used to facilitate diagnosis of Lambert- 
Eaton Syndrome (LES) . LES autoimmune disease is 
characterized by- insufficient release of acetylcholine 
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5 from motor nerve terminals which normally are 

responsive to nerve impulses. IgG from LES patients 
block individual voltage-dependent calcium channels 
and thus inhibit calcium channel activity (Kim and 
Neher, Science 239:405-8 (1988)). To diagnose LES, a 
10 test sample of blood or other bodily fluid is obtained 
from an individual suspected of having or carrying the 
disease. The test sample is contacted with a calcium 
channel subunit of the invention, or a calcium channel 
formed by the three subunits of the invention, under 
15 conditions which would allow any antibody which is 
specific for the calcium channel subunit or the 
calcium channel, and which may be present in the test 
sample, to bind. Binding of antibody to the calcium 
channel subunit of the invention, or the calcium 
20 channel formed by the subunits of the invention, if 
such binding exists, is then detected. The presence 
of binding indicates that the individual has 
antibodies to the calcium channel, and thus, is 
afflicted with LES. 
25 The invention is further illustrated, by the 

following Examples. 

Example 1 : 

Isolation and Expression of Clones for Human alB 
Subunit of N-Type Channel 
30 Sequences referred to herein are described in 

comparison to the sequence encoding an alB subchannel, 
described in the HSCACHNT data base (see Ellis, S.B. 
et al., Science 257:389-395 (1992)). 

A, Isolation of Primary Clones 
35 Three primary clones, each containing- a portion 

of the a 1B subunit, were obtained by hybridization 
under low stringency conditions with human, cerebellum 
library at 50 °C in 6X sodium chloride, -sodium citrate 
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5 (SCO overnight. Hybridized nucleic acids were washed 
using standard techniques (see Sambrook et al . (eds). 
Molecular Cloning: A Laboratory Manual, (2nd ed.), 
Cold Spring Harbor Laboratory Press (1989)). 

First, clone pN, which spans sequences 54-1405, 
10 was cloned into the Smal site of pSK- (Bluescript ) . 

The Sad site of the polylinker is at the 5' end. The 
Smal site was destroyed. pN has a single nucleotide 
change (G to A at position 194), which alters the 
amino acid at the corresponding position in the 
15 protein from gly to ser. 

The second clone, pM, spans sequences 978-4562. . 
EcoRI/blunt fragment was inserted into EcoRI/Smal 
sites of pSK-. The Kpnl site of the polylinker was at 
the 5' end. The Smal site was destroyed. pM has a . 
20 single nucleotide change <T to G at position 2559), 

which results in an alteration of the amino acid at : 
the corresponding position from leu to arg . 

Clone pC, which spans sequences 4105-7322, was 
also isolated. An EcoRI/Xbal fragment, was cloned into 
25 EcoRI/Xbal restricted pNK-CMV vector. The SacI site 
of the polylinker is at the 5 1 end. Clone pC has two 
changes: a single nucleotide change at position 6470 
(G to A) , resulting in an alteration of the 
corresponding amino acid from gly to ser; and a six 
30 base pair deletion at nucleic acid positions 48.14- 

4819, resulting in a deletion of the two corresponding 
amino acids (glu and thr) . 

B . Generation of pSK-Hal 
In order to generate a clone containing the entire alB 
35 cDNA, combination vectors, were made. First, pMCwas 
created. To allow cleavage by MamI, both pM and pC 
were transformed into SCS110 cells (Stratagene; dam-, 
dcm-), and the unmethylated DNA was isolated. pC was 
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5 digested with Xbal and MamI, and the 2930 bp fragment 
was isolated. pM was also digested with MamI and 
Xbal, and the 6.4 kb vector + insert band was 
isolated. These two fragments were ligated together 
to create pMC . The fusion point is the MamI site at 
10 position 4395. 

pNMC, a fusion of pN and pMC, was then generated. 
pN was restricted with Kpnl and Xbal, and the 1.4 kb 
insert was isolated. pMC was restricted with Xbal and 
partially restricted with Kpnl \ The 6012 Xbal/Kpnl 
15 fragment was isolated. These two fragments . were 

ligated together, and the ligation reaction was then 
cut with Xbal. The resulting fragments were then 
ligated to a pSK- vector restricted with Xbal. The 
resulting plasmid contains the entire coding sequence 
20 of al, with 91 bp 5' UT, and 157 bp 3' UT (pos 54- 

7322) . the fusion point is the Kpnl site at position 
1310. The Kpnl site of the pSK- polylinker is at the 
5' end. This pNMC vector is referred to as pSK-Hal. 
C . Expression Clones 
25 Two expression clones were generated. The first, 

pNK-Hal, was constructed by isolating the Xbal insert 
of pSK-Hal; this Xbal insert was inserted into the 
Xbal site of pNK-CMV. The polylinker SacI site is at 
the 5* end. 

30 A second expression clone, pNK-Hal-Koz, was also 

generated. A 5 f primer containing an optimized Kozak 
sequence (CCACCATGG) (SEQ ID NO. 7), an EcoRI site, 
and surrounding bases was synthesized. A 3' primer 
spanning the Bglll site at position 1463 was also 

35 synthesized. These primers were used to PCR the al 5 f 
end from pNK-Hal. This product was cut with EcoRI and 
Bglll, and cloned into a likewise restricted pNK-Hal 
plasmid. The resulting truncated plasmid contains the 
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10 



15 



20 



Kozak sequence and has been shown to express at least 
as well as the parent plasmid in transient 
transfection studies. 

The Hal gene thus includes several changes from 
the previously known sequence presented in the 
HSCACHNT Genbank file. A single nucleotide change (G 
to A at position 194), alters the amino acid at the 
corresponding position in the protein from gly to ser; 
a single nucleotide change (T to G at position 2559) , 
result* in an alteration of the amino acid at the 
corresponding position from leu to arg . Also, a 
single nucleotide change at position 6470 (G to A) , 
results in an alteration of the corresponding amino 
acid from gly to ser; and a six base pair deletion at 
nucleic acid positions 4814-4819, results in a 
deletion of the two corresponding amino acids (glu and 
thr) . These alterations are summarized in Table I. 



Table I: alB Alterations 



R Position 


Nucleotide Change 


Amino Acid Change | 


I 194 


G to A 


Gly to Ser 1 


| 2559 


T to G 


Leu to Arg 


1 6470 


G to A 


Gly to Ser 


I 4814-4819 


Deletion 6 bp 


Deletion Glu and Thr 



25 Example 2 ; 

Isolation and Expression of Clones for Human a2d 
Subunit of N-Tvpe Channel 
Sequences referred to herein are described in 
comparison to the sequence encoding an a2d subchannel, 
30 described in the HSCACNLB data base (Harpold, M.M. et 
al.r Neuron 8:71-84 (1992)). 
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5 a. Isolation of Primary Clones and Generation 

of clone pNK-Ha2 
The clone pNK-Ha2, was constructed as a fusion of 
two PGR clones. The template was human cerebellum 
QuickClone (Clonetech) cDNA, and the two PCR clones 

10 included sequences from 16-1409 (p2110) and from 1379- 
3313 (p2223) . They were isolated as T/A clones in 
PCRII (Invitrogen) with the polylinker NotI site at 
the 5' end of each clone. p2110 was restricted with 
EcoRI and Nsil, and p2223 was restricted with Nsil and 

15 KpnI. The gel-purified inserts were fused at the Nsil 
site (Pos. 1394) and cloned into an EcoRI/Kpnl 
restricted pNK-CMV vector. The SacI site is at the 5' 
end* 

This clone, pNK-Ha2, contained certain 
20 alterations from a previously identified a2 sequence 
described in the HSCACNLB EMBL file. These 
alterations are summarized in Table II. 



Table II: Alterations in H»2d 



Position 


Nucleotide Change 


Amino Acid M 
Change 1 


329 


A to C 


Ser to Arg | 


1191 


G to C 


Arg to Thr 1 


1 1219 


G to C 


Glu to Asp 


I 1596 


T to C 


Val to Ala 


I 1980 


T to C 


lie to Thr 


I 2090 


A to G 


Asn to Asp J 


I 3261 


A to G 


Val to Ala 



25 Expression Clones 

Two expression clones were generated. First, 
pNK-Ha2 was cut with KpnI and blunt ended. EcoRI 
linkers were attached. The reaction was treated with 
EcoRI, and the 3.3 kb EcoRI insert was isolated. This 
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EcoRI insert was ligated into two different vectors, 
pED, and pBabe-CMV, which had been restricted with 
EcoRI. The correct orientations were selected. The 
resultant expression clones are referred to herein as 
pBabe-Ha2 and pED-Ha2 . 



Example 3: 
Isolation and Expressio n of Clones for 
Human b 3 Subunit of N-Typ e Channel 
Sequences referred to herein are described in 
15 comparison to the sequence encoding an b3 subchannel, 
described in the HUMCALBA data base Genbank file (see 
Collin, T. et al . , Eur. J. Biochem. 220 (1) : 257-262 
(1994) ) . 

20 A^ Isolation of Primary Cl ones and 

Generation of Clone pNK-Hb3 
A blunt ended PCR product spanning sequences 21- 
1490 was cloned into the EcoRI site of pSK- . The 
template was human cerebellum QuickClone (Clonetech) 

25 cDNA, generating P SK-Hb3. The EcoRI/Kpnl insert of 
P SK-Hb3 was isolated and subsequently subcloned into 
the EcoRI /Kpnl sites of pNK-CMV. The Sad site of the 
polylinker is at the 5' end. This clone, pNK-Hb3, 
contained certain alterations from a previously 

30 identified b3 sequence described in the HUMCALBA 

Genbank file. These alterations are summarized in 
Table III. 
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5 Table III: Alfceratx hs in Hb3 



i Position 


Nucleotide Change 


Amino Acid Change jl 


1 46 


C to G 


Leu to Val | 


1 119 


TCC insertion 


Ser insertion | 


203 


A to T 


Glu to Val 1 


300 


C to T 


no change . Q 


303 


C to G 


no change U 


420 


A to G 


no change 


4 38 


C to T 


no change 


477 


T to C 


no change 


| 486 


T to G 


no change I 


1 534 


G to A 


no change II 


I 552 


A to C 


no change 1 


1 561 


G to T 


no change | 


\ 978 


ATG insertion 


Met insertion I 


1064 


T to A 


Leu to His | 


1283-4 


CG to GC 


Thr to Ser 


1308 


C to T 


no change 



Example 4 

Demonstration of Formation of Calcium Channels 
CHODUX cells (Mitchell, P.J. et al., Mol . Cell 
10 Bio. 6^6^:1926-35 (1986), Genbank M13476, M13477) were 
seeded on 100 mm dishes with 1 to 2 X 10 6 cells, at 24 
hours before transf ection. " Cells were transformed 
with each of the following three expression clones: 
pNK-Hal, pNK-Ha2, and pNK-Hb3, using CaP0« transf ection 
15 kit (Stratagene Mammalian Transf ection Kit) according 
to the manufacturer's instructions. Cells were then 
washed using a single PBS wash, being careful not to 
dislodge cells. Thirty to 35 mg DNA was used per 
plate, in an approximately 2:1:1 subunit ratio (20 mg 
20 alB, 7.5 mg a2d, 5 mg b3) . Transf ection was allowed 
to proceed for approximately 16 hours overnight. 

Saturating binding experiments were performed on 
the cells using a protocol as described by Harpold et 
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5 al. (Science 257:389-395 (1992)). Briefly, reaction 
tubes containing 125 I-omega-conotoxin GVIA (NEN) were 
prepared. The 12S I-omega-conotoxin GVIA is packaged in 
vials of 10 mCi at a specific activity of 2200 
Ci/tnmol. The vial is resuspended in 450 ml water, to 
10 obtain 10 nM. For a 200 P M (saturating) final 

concentration, 10ml/0.5 ml reaction was used. Cells 
were washed IX with PBS and resuspended with 1-2 ml 
binding buffer and BSA per plate by pipetting. 
w^ri^fc.Piv 500-000 cells (50-100 ml) per 0.5 ml 
15 reaction were used to initiate binding reaction. The 
reaction was allowed to proceed for 30-60 minutes at 
37 °C. Subsequently, 1 ml cold wash buffer + BSA was 
added, and cells were pelleted by 5 minutes at 2.8 
krpm at 4°C (Sorvall RT6000) . Cells were washed IX 
20 with 2 ml cold wash buffer + BSA and resuspended by 

gently vortexing; subsequently, they were repelleted. 
After aspiration of liquid, scintillation counting was 
performed. 

Results of the experiments demonstrated specific 
25 omega-conotoxin GVIA binding to whole cells. Specific 
binding indicates that the ^expression clones were 
expressed, and that the expressed subunits formed 
calcium channels. 

Example 5 

30 Demonstration of Inhibition of omega-Conotoxin GVIA- 
fiensitive Potassium-Stimulated C alcium Uptake 
The uptake of * 5 Ca into cells was performed by an 
adaptation of the method of Tan, K. and A.H. Tashjian 
(J. Biol. Chem. 259:418-426 (1984). The principle of 
35 the method involves activating ion permeation through 
synaptosomal calcium channels by high potassium- 
induced depolarization of the synaptosomal 
preparation. The uptake of 4S Ca measured by this 
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5 procedure is mediated by N-type calcium channels, and 
is sensitive to dihydropyridine, phenylalkylamine, and 
benzothiazipine Ca antagonists at therapeutically 
relevant concentrations (Tan and Tashjian, ibid-). 
Cells were transfected with the three human 
10 calcium channel subunits described herein. 

Transfected cells were suspended in 15 ml growth 
medium (Ham's F-10 medium plus 15% heat-inactivated 
horse serum and 2.5% heat-inactivated fetal bovine 
serum) . The cells were centrifuged, resuspended, and 
15 then added to T-75 flasks containing 12-15 mis growth 
medium, and incubated at 37 °C for approximately one 
week. The cells were then removed from the flask 
after dissociation from the walls of the flask by 
treatment for 5 minutes at 37 °C with 10 mM EDTA in 
20 phosphate buffered saline. The buffer was decanted, 

and the cells were resuspended in approximately 200 ml 
of growth medium. The cells were then aliquoted 200 
mg/well) into each well of several 96-well plates, and 
grown under the aforementioned conditions for 3-4 
25 weeks, with replacement of growth medium occurring 

twice per week. Cells were fed growth medium 24 hours 
before they are employed for 4S Ca uptake 
determinations . 

At the time of the assay, media was aspirated 
30 from each 96-well plate using a manifold designed to 
allow 50 mL of liquid to remain in each well. Each 
plate was washed arid aspirated twice with a low K+ 
buffer solution "LKHBBS" (in mM: 5 KC1, 145 NaCl, 10 
Hepes, 1 MgCl 2 , 0.5 CaCl 2 , 10 glucose, pH 7.4), 200 
35 ml/well. Each plate was incubated for 10 minutes at 
37°C, and aspirated as above. To each well of each 
plate, 50 ml of LKHBBS containing the agent in twice 
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15 



20 



the final concentration was added. The agents added 
are set forth in Table IV. 

Table IV: 

Agents Added to Transfectad Cells, Demonstrating 
omega-Conotoxin GVIA- Sensitive Potassium- Stimulated 

Calcium Op take 



25 



II Sample 


Cell Type 


Agent Added 


I 1 


HEK293 


Control (no agent) 


8 2 


HEK293 


7 5 mM KC1 


1 3 


T9 


Control (no agent) 


1 4 


T9 


74 mM KC1 


1 5 


T9 


SNX-111 (10 mM) 


8 6 


T9 


SNX-111 (10 mM) plus KCl (negative 
control) 


1 7 


T9 


gadolinium (10 mM) 


8 8 


T9 


gadolinium (10 mM) plus KCl (negative 
control) 



The plates were incubated for 10 minutes at room 
temperature. To each well of each plate, 50 ml of 
either of two solutions were added: (a) LKHBBS 
containing 1 mCi of carrier-free < 5 Ca, or (b) HKHBBS (a 
high K+ buffer containing 150 mM KCl and no NaCl, but 
otherwise identical to LKHIfBS) . 

Each plate was then incubated for 5 minutes at 
room temperature, aspirated as above, and quenched 
with 200 ml/well of Quench Buffer (Ca-free LKHBBS 
containing 10 mM Tris-EGTA) . Each plate was aspirated 
and rinsed with Quench Buffer a second time, then 
carefully aspirated to dryness. To each well of each 
plate 100 ml of High Safe II scintillation fluid 
(MicroScint (Packard)) was added. The plates were 
sealed, shaken, and subjected to scintillation 
spectrophotometry on a Microbeta 96-well Scintillation 
Counter (Wallac, Gaithersburg, MD, USA) . Averag 
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5 counts per minute (CPM) are plotted in Figure 4 

(numbering of data points corresponds to the numbering 
of agents in Table IV) . Results indicated that omega- 
conotoxin GVIA-sensitive potassium-stimulated calcium 
uptake occurred in the transfected cells, and was 

10 inhibited by the agents known to inhibit L-type 

calcium channels, thereby demonstrating that the three 
subunits formed a functioning calcium channel. 

Ecyu i va. 1 en fca 

15 Those skilled in the art will recognize, or be 

able to ascertain using no more than routine 
experimentation, many equivalents to the specific 
embodiments of the invention described specifically 
herein. Such equivalents are intended to be 

20 encompassed in the scope of the following claims. 
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SEQUENCE LISTING 



(i) GENERAL INFORMATION: 

(i) APPLICANT: Franco, Rodrigo 
Sun Chen, Ai Ru 
Suey, David J. 

(ii) TITLE OF INVENTION: NUCLEIC ACID ENCODING HUMAN NEURONAL 
CALCIUM CHANNEL SUBUNITS 

(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hamilton, Brook, Smith £. Reynolds, ?.C. 

(B) STREET: Two Militia Drive 
'C5 CITY! Lsxing^on 

(D) STATE: MA 

(E) COUNTRY: USA 

{ F) ZIP: 02173-4799 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1-30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/713,118 
{B) FILING DATE: 16-SEP-1996 
(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Mata, Elizabeth W. 

(B) REGISTRATION NUMBER: 38,236 

(C) REFERENCE/ DOCKET NUMBER: 96188/KT 

tix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 617-861-6240 
<B> TELEFAX: 617-861-9540 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7266 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 92.. 7102 
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!xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TCCGTGGCT3 CTCCGCTCTG AGCGCCTGGC GCGCCCCGCG CCCTCCCTGC CGGGGCCGCT 

GGGCCGGGGA TGCACGCGGG GCCCGGGAGC C ATG GTC CGC TTC GGG GAC GAG 
112 

Met Val Arg Phe Gly Asp Giu 

CTG GGC GGC CGC TAT GGA GGC CCC GGC AGC GGA GAG CGG GCC CGG GGC 
160 

Leu Gly Gly Arg Tyr Gly Gly Pro Gly Ser Gly Glu Arg Ala Arg Gly 
10 15 20 

GGC GGG GCC GGC GGG GCG GGG GGC CCG GGT CCC GGG GGG CTG CAG CCC 
208 

Gly Gly Ala Gly Gly Ala Gly Gly Pro Gly Pro Gly Gly Leu Gin Pro 
25 • 30 35 

GGC CAG CGG GTC CTC TAC AAG CAA TCG ATC GCG CAG CGC GCG CGG ACC 
256 

Gly Gin Arg Val Leu Tyr Lys Gin Ser He Ala Gin Arg Ala Arg Thr 

40 45 50 55 

ATG GCG CTG TAC AAC CCC ATC CCG GTC AAG CAG AAC TGC TTC ACC GTC 
304 

Met Ala Leu Tyr Asn Pro He Pro Val Lys Gin Asn Cys Phe Thr Val 
60 65 70 

AAC CGC TCG CTC TTC GTC TTC AGC GAG GAC AAC GTC GTC CGC AAA TAC 
352 

Asn Arq Ser Leu Phe Val Phe Ser Glu Asp Asn Val Val Arg Lys Tyr 
75 80 85 

GCG AAG CGC ATC ACC GAG TGG CCT CCA TTC GAG TAT ATG ATC CTG GCC 
400 

Ala Lvs Arg He Thr Glu Trp Pro Pro Phe Glu Tyr Met He Leu Ala 
90 95 100 

ACC ATC ATC GCC AAC TGC ATC GTG CTG GCC CTG GAG CAG CAC CTC CCT 
448 

Thr He He Ala Asn Cys He Val Leu Ala Leu Glu Gin His Leu Pro 
105 HO I 15 

GAT GGG GAC AAA ACG CCC ATG TCC GAG CGG CTG GAC GAC ACG GAG CCC 
496 

Asp Gly Asp Lys Thr Pro Met Ser Glu Arg Leu Asp Asp Thr Glu Pro 

120 125 130 135 

TAT TTC ATC GGG ATC TTT TGC TTC GAG GCA GGG ATC AAA ATC ATC GCT 
544 

Tvr Phe He Gly lie Phe Cys Phe Glu Ala Gly He Lys lie lie Ada 

140 145 150 

CTG GGC TTT GTC TTC CAC AAG GGC TCT TAC CTG CGG AAC GGC TGG AAC 
592 

Leu Gly Phe Val Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn 
• 155 160 165 

GTC ATG GAC TTC GTG GTC GTC CTC ACA GGG ATC CTT GCC ACG GCT GGA 

Val Met Asp Phe Val Val Val Leu Thr Gly He Leu Ala Thr Ala Gly 
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170 "5 . ■ 180 

ACT GAC TTC SAC CTG CGA ACA C7G AGG GCT GTG CGT GTG CTG AGG CCC 

"J Asp Phe Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu Arc Pro 

185 190 195 

CTG AAG CTG GTG TCT GGG ATT CCA ACT TTG CAG GTG GTG CTC AAG TCC 

III Lys Leu Val Ser Gly He Pro Ser Leu Gin Val Val Leu Lys Ser 
200 205 210 

ATC ATG AAG GCC ATG GTT CCA CTC CTG CAG ATT GGG CTG CTT CTC TTC 

He Met Lys Ala Met Val Pro Leu Leu Gin He Gly Leu Leu Leu Phe 

220 225 ' JU 

TTT GCC ATC CTC ATG TTT GCC ATC ATT GGC CTG GAG TTC TAC ATG GGC 

Phe Ala lie Leu Met Phe Ala lie He Gly Leu Glu Phe Tyr Met Gly 

235 240 
AAG TTC CAC AAG GCC TGT TTC CCC AAC AGC ACA GAT GCG GAG CCC GTG 

Lys Phe His Lys Ala Cys Phe Pro Asn Ser Thr Asp Ala Glu Pro Val 
Y 250 255 260 

GGT GAC TTC CCC TGT GGC AAG GAG GCC CCA GCC CGG CTG TGC GAG GGC 

III Asp Phe Pro Cys Gly Lys Glu Ala Pro Ala Arg Leu Cys Glu Gly 
265 270 275 

GAC ACT GAG TGC CGG GAG TAC TGG CCA GGA CCC AAC TTT GGC ATC ACC 

Asp Thr Glu Cys Arg Glu Tyr Trp Pro Gly Pro Asn Phe Gly lie Thr 
280 285 290 ^ 

AAC TTT GAC AAT ATC CTG TTT GCC ATC TTG ACG GTG TTC CAG TGC ATC 

Asn'phe Asp Asn He Leu Phe Ala He Leu Thr Val Phe Gin Cys lie 



300 



ACC ATG GAG GGC TGG ACT GAC ATC CTC TAT AAT ACA AAC GAT GCG GCC 

Th^Met Glu Gly Trp Thr Asp He Leu Tyr Asn Thr Asn Asp Ala Ala 
315 320 J " 

GGC AAC ACC TGG AAC TGG CTC TAC TTC t ATC CCT CTC ATC ATC ATC GGC 

£"°Asn Thr Trp Asn Trp Leu Tyr Phe He Pro Leu lie He lie Gly 
330 335 J * u 

TCC TTC TTC ATG CTC AAC CTG GTG CTG GGC GTG CTC TCG GGG GAG TTT 

si" Phe Phe Met Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe 

345 350 355 

GCC AAG GAG CGA GAG AGG GTG GAG AAC CGC CGC GCC TTC CTG AAG CTG 

A^Lys Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu 
360 365 370 
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CGC CGG CAG CAG CAG ATC GAG CGA GAG CTC AAC GGG 7AC C7G GAG TGG 
1264 

Ara Arq Gin Gin Gin lie Glu Arg Glu Leu Asn Gly Tyr -eu Giu Trp 
380 385 390 

ATC TTC AAG GCG GAG GAA GTC ATG CTG GCC GAG GAG GAC AGG AAT GCA 
1312 

lie Phe Lys Ala Glu Glu Val Met Leu Ala Glu Glu Asp Arg Asn Ala 
395 400 405 

GAG GAG AAG TCC CCT TTG GAC GTG CTG AAG AGA GCG GCC ACC AAG AAG 
1360 

Glu Glu Lys Ser Pro Leu Asp Val Leu Lys Arg Ala Ala Thr Lys Lys 
410 415 420 

AGC AGA AAT GAC CTG ATC CAC GCA GAG GAG GGA GAG GAC CGG TTT GCA 
1408 

Ser Arg Asn Asp Leu lie His Ala Glu Glu Gly Glu Asp Arg Phe Ala 
425 430 435 

GAT CTC TGT GCT GTT GGA TCC CCC TTC GCC CGC GCC AGC CTC AAG AGC 
1456 

Asd Leu Cys Ala Val Gly Ser Pro Phe Ala Arg Ala Ser Leu Lys Ser 

440 445 450 455 

GGG AAG ACA GAG AGC TCG TCA TAC TTC CGG AGG AAG GAG AAG ATG TTC 
1504 

Glv Lys Thr Glu Ser Ser Ser Tyr Phe Arg Arg Lys Glu Lys Met Phe 

460 465 470 

CGG TTT TTT ATC CGG CGC ATG GTG AAG GCT CAG AGC TTC TAC TGG GTG 
1552 

Arq Phe Phe He Arg Ara Met Val Lys Ala Gin Ser Phe Tyr Trp Val 
475 480 485 

GTG CTG TGC GTG GTG GCC CTG AAC ACA CTG TGT GTG GCC ATG GTG CAT 
1600 

Val Leu Cys Val Vai Ala Leu Asn Thr Leu Cys Val Ala Met Val His 
490 495 500 

TAC AAC CAG CCG CGG CGG CTT ACC ACG ACC CTG TAT TTT GCA GAG TTT 

Ty^Asn Gin Pro Arg Arg Leu Thr Thr Thr Leu Tyr Phe Ala Glu Phe 
505 510 515 

GTT TTC CTG GGT CTC TTC CTC ACA GAG ATG TCC CTG AAG ATG TAT GGC 
1696 

Val Phe Leu Gly Leu Phe Leu Thr Glu Met Ser Leu Lys Met Tyr Gly 
520 525 530 535 

CTG GGG CCC AGA AGC TAC TTC CGG TCC TCC TTC AAC TGC TTC GAC TTT 
1744 

Leu Gly Pro Arg Ser Tyr Phe Arg Ser Ser Phe Asn Cys Phe Asp Phe 

540 545 550 

GGG GTC ATC GTG GGG AGC GTC TTT GAA GTG GTC TGG GCG GCC ATC AAG 
1792 

Glv Val He Val Gly Ser Val Phe Glu Val Val Trp Ala Ala He Lys 
555 560 565 

CCG GGA AGC TCC TTT GGG ATC AGT GTG CTG CGG GCC CTC CGC CTG CTG 
1840 
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P^o Gly Ser Ser Phe Gly He Ser Val Leu Arg hla Leu Arg Leu Leu 
570 575 - BU 

AGG ATC TTC AAA GTC ACG AAG TAC TGG AGC TCC CTG C3G AAC CTG GTG 

Arg 8 He Phe Lys Val Thr Lys Tyr Trp Ser Ser Leu Arg Asn Leu Val 
585 590 595 

GTG TCC CTG CTG AAC TCC ATG AAG TCC ATC ATC AGC CTG CTC TTC TTG 

valuer Leu Leu Asn Ser Met Lys Ser lie lie Ser Leu Leu Phe Leu 
600 605 610 61S 

CTC TTC CTG TTC ATT GTG GTC TTC GCC CTG CTG GGG ATG CAG CTG TTT 

LeT Phe Leu Phe He Val Val Phe Ala Leu Leu Giy Met Gin Leu Phe 
620 625 OJU 

GGG GGA CAG TTC AAC TTC CAG GAT GAG ACT CCC ACA ACC AAC TTC GAC 

G?y 2 Gly Gin Phe Asn Phe Gin Asp Glu Thr Pro Thr Thr Asn Phe Asp 

ACC TTC CCT GCC GCC ATC CTC ACT GTC TTC CAG ATC CTG ACG GGA GAG 

T Sr°Phe Pro Ala Ala He Leu Thr Val Phe Gin lie Leu Thr Gly Glu 
650 655 660 

GAC TGG AAT GCA GTG ATG TAT CAC GGG ATC GAA TCG CAA GGC GGC GTC 

Asp 8 Tr P Asn Ala Val Met Tyr His Gly He Glu Ser Gin Gly Gly Val 
665 670 675 

AGC AAA GGC ATG TTC TCG TCC TTT TAC TTC ATT GTC CTG ACA CTG TTC 

Ser 6 Lys Gly Met Phe Ser Ser Phe Tyr Phe. lie Val Leu Thr Leu Phe 

GGA AAC TAC ACT CTG CTG AAT GTC TTT CTG GCC ATC GCT GTG GAC AAC 

Gly 4 Asn Tyr Thr Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn 
700 705 /1U 

CTG GCC AAC GCC CAA GAG CTG ACC AAG GAT GAA GAG GAG ATG GAA GAA 

"a 2 Ala Asn Ala Gin Glu Leu Thr Lys Asp Glu Glu Glu Met Glu Glu 
715 720 

GCA GCC AAT CAG AAG CTT GCT CTG CAA AAG GCC AAA GAA GTG GCT GAA 

A?!°Ala Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala <31u 
730 "5 74 ° 

GTC AGC CCC ATG TCT GCC GCG AAC ATC TCC ATC GCC GCC AGG CAG CAG 

VaTser Pro Met Ser Ala Ala Asn He Ser He Ala Ala Arg Gin Gin 
745 750 755 

AAC TCG GCC AAG GCG CGC TCG GTG TGG GAG CAG CGG GCC AGC CAG CTA 



Asn Ser Ala Lys Ala Arg Ser Val Trp Glu Gin Arg Ala Ser Gin Leu 

760 7 65 770 77:5 
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CGG CTG C.-.G AAC CTG CGG GCC AGC TGC GAG GCG CTG TAC AGC GAG ATG 

Aro^Leu Gin Asn Leu Arg Ala Ser Cys Glu Ala Leu Tyr Ser Glu Met 
* 780 "?85 790 

GAC CCC GAG GAG CGG CTG CGC TTC GCC ACT ACG CGC CAC CGG CGG CCC 
2512 

Asd Pro G-u Glu Arg Leu Ara Phe Ala Thr Thr Arg His Arg Arg Pro 

" 795 ' 800 805 

GAC ATG AAG ACG CAC CTG GAC CGG CCG CTG GTG GTG GAG CTG GGC CGC 

Asp°Met Lys Thr His Leu Asp Arg Pro Leu Val Val Glu Leu. Gly Arg 
810 815 820 

GAC GGC GCG CGG GGG CCC GTG GGA GGC AAA GCC CGA CCT GAG GCT GCG 
2606 

Asp Gly Ala Arg Gly Pro Val Gly Gly Lys Ala Arg Pro Glu Ala Ala 

825 ■ 830 835 

GAG GCC CCC GAG GGC GTC GAC CCT CCG CGC AGG CAC CAC CGG CAC CGC 
2656 

Glu Ala P-o Glu Gly Val Asp Pro Pro Arg Arg His His Arg His Arg 

840 845 B50 855 

GAC AAG GAC AAG ACC CCC GCG GCG GGG GAC CAG GAC CGA GCA GAG GCC 

Asp 4 Lys Asd Lys Thr Pro Ala Ala Gly Asp Gin Asp Arg Ala Glu Ala 
* 860 865 870 

CCG AAG GCG GAG AGC GGG GAG CCC GGT GCC CGG GAG GAG CGG CCG CGG 

Pro 2 Lys Ala Glu Ser Gly Glu Pro Gly Ala Arg Glu Glu Arg Pro Arg 
875 880 885 

CCG CAC CGC AGC CAC AGC AAG GAG GCC GCG GGG CCC CCG GAG GCG CGG 
2800 

Pro His Arg Ser His Ser Lys Glu Ala Ala Giy Pro Pro Glu Ala Arg 
890 895 _ 900 

AGC GAG CGC GGC CGA GGC CCA GGC CCC GAG GGC GGC CGG CGG CAC CAC 
2848 

Ser Glu Arg Gly Arg Gly Pro Gly Pro Glu Gly Gly Arg Arg His His 
905 910 915 

CGG CGC GGC TCC CCG GAG GAG GCG GCC GAG CGG GAG CCC CGA CGC CAC 
2896 

Arg Arg Gly Ser Pro Glu Glu Ala Ala Glu Arg Glu Pro Arg Arg His 

920 925 930 935 

CGC GCG CAC CGG CAC CAG GAT CCG AGC AAG GAG TGC GCC GGC GCC AAG 
2944 

Arq Ala His Arg His Gin Asp Pro Ser Lys Glu Cys Ala Gly Ala Lys 
940 945 950 

GGC GAG CGG CGC GCG CGG CAC CGC GGC GGC CCC CGA GCG GGG CCC CGG 
2992 

Gly Glu Arg Arg Ala Arg His Arg Gly Gly Pro Arg Ala Gly Pro Arg 
955 960 965 
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GAG GCG GAG AGC GGG GAG GAG CCG GCG CGG CGG CAC CGG GCC CGG CAC 

3iu°Ala Glu Ser Gly Glu Glu Pro Ala Arg Arg His Arg Ala Arg His 
970 975 980 

AAG GCG CAG CCT GCT CAC GAG GCT GTG GAG AAG GAG ACC ACG GAG AAG 

^088 

*vs Ala Gin Pro Ala His Glu Ala Val Glu Lys Glu Thr Thr Glu Lys 
985 990 995 

GAG GCC ACG GAG AAG GAG GCT GAG ATA GTG GAA GCC GAC AAG GAA AAG 

3136 

Glu Ala Thr Glu Lys Glu Ala Glu He Val Glu Ala Asp Lys Glu Lys 
10 00 1005 1010 1015 

GAG CTC CGG AAC CAC CAG CCC CGG GAG CCA CAC TGT GAC CTG GAG ACC 
? } ^ ^ . » n i.j ^ ~ ci - 5 rr, r. 1 m D*-o Hi c C\rs Ajso Leu Glu Thr 

tjiU uB'O. *rti-y nail w-.*. - - ~ * ^ -j ' * r\*zr\ 

1020 1025 1030 

AGT GGG ACT GTG ACT GTG GGT CCC ATG CAC ACA CTG CCC AGC ACC TGT 
3232 

Ser Gly Thr Val Thr Val Gly Pro Met His Thr Leu Pro Ser Thr Cys 
1035 1040 1045 

CTC CAG AAG GTG GAG GAA CAG CCA GAG GAT GCA GAC AAT CAG CGG AAC 
328 0 

Leu Gin Lys Val Glu Glu Gin Pro Glu Asp Ala Asp Asn Gin Arg Asn 
1050 1055 1060 

GTC ACT CGC ATG GGC AGT CAG CCC CCA GAC CCG AAC ACT ATT GTA CAT 

3328 

Val Thr Arg Met Gly Ser Gin Pro Pro Asp Pro Asn Thr He Val His 
1065 1070 1075 

ATC CCA GTG ATG CTG ACG GGC CCT CTT GGG GAA GCC ACG GTC GTT CCC 

111 Pro Val Met Leu Thr Gly Pro Leu Gly Glu Ala Thr Val Val Pro 
1080 1085 1090 1095 

AGT GGT AAC GTG GAC CTG GAA AGC CAA GCA GAG GGG AAG AAG GAG GTG 

Ser^Gly Asn Val Asp Leu Glu Ser Gin Ala Glu Gly Lys Lys Glu Val 
1100 H05 H10 

GAA GCG GAT GAC GTG ATG AGG AGC GGC CCC CGG CCT ATC GTC CCA TAC 
3472 

Glu Ala Asp Asp Val Met Arg Ser Gly Pro Arg Pro He Val Pro Tyr 
1115 1120 H25 

AGC TCC ATG TTC TGT TTA AGC CCC ACC AAC CTG CTC CGC CGC TTC T<3C 

Ser°Ser Met Phe Cys Leu Ser Pro Thr Asn Leu Leu Arg Arg Phe Cys 
1130 1135 H40 

CAC TAC ATC GTG ACC ATG AGG TAC TTC <3AG <3TG GTC ATT CTC GTG GTC 
3568 

His Tyr He Val Thr Met Arg Tyr Phe Glu Val Val He Leu Val Val 
1145 1150 H55 

ATC GCC TTG AGC AGC ATC GCC CTG GCT GCT GAG GAC CCA GTG CGC ACA 
3616 
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-le Ala Leu Ser Ser He Ala Leu Ala Ala Glu Asp Pro Val Arg Thr 
Il60 H6S 1115 

GAC TCG CCC AGG AAC AAC GCT CTG AAA TAC CTG GAT TAC ATT TTC ACT 

Asp 4 Ser Pro Arg Asn Asn Ala Leu Lys Tyr Leu Asp Tyr He Phe Thr 
HBO 1185 il90 

GGT GTC TTT ACC TTT GAG ATG GTG ATA AAG ATG ATC GAC TTG GGA CTG 

Glv Val Phe Thr Phe Glu Met Val He Lys Met He Asp Leu Gly Leu 
y 1195 1200 1205 

CTG CTT CAC CCT GGA GCC TAT TTC CGG GAC TTG TGG AAC ATT CTG GAC 

Le^Leu His Pro Gly Ala Tyr Phe Arg Asp Leu Trp Asn He Leu Asp 
1210 1215 1220 

TTC ATT GTG GTC AGT GGC GCC CTG GTG GCG TTT GCT TTC TCA GGA TCC 
3808 

°he He Val Val Ser Gly Ala Leu Val Ala Phe Ala Phe Ser Gly Ser 

1225 1230 1235 

AAA GGG AAA GAC ATC AAT ACC ATC AAG TCT CTG AGA GTC CTT CGT GTC 

Lvs 6 Gly Lys Asp He Asn Thr He Lys Ser Leu Arg Val Leu Arg Val 
X240 1245 1250 1255 

CTG CGG CCC CTC AAG ACC ATC AAA CGG CTG CCC AAG CTC AAG GCT GTG 

Leu 4 Arg Pro Leu Lys Thr lie Lys Arg Leu Pro Lys Leu Lys Ala Val 
1260 1265 12'0 

TTT GAC TGT GTG GTG AAC TCC CTG AAG AAT GTC CTC AAC ATC TTG ATT 

Phi 2 Asp Cys Val Val Asn Ser Leu Lys Asn Val Leu Asn lie Leu He 
1275 1280 1285 

GTC TAC ATG CTC TTC ATG TTC ATA TTT GCC GTC ATT GCG GTG CAG CTC 

Vai°Tyr Met Leu Phe Met Phe He Phe Ala Val He Ala Val Gin Leu 
1290 1295 13°° 

TTC AAA GGG AAG TTT TTC TAC TGC ACA GAT GAA TCC AAG GAG CTG GAG 

Phe 8 Lys Gly Lys Phe Phe Tyr Cys Thr Asp Glu Ser Lys Glu Leu Glu 
1305 1310 1315 

AGG GAC TGC AGG GGT CAG TAT TTG GAT TAT GAG AAG GAG GAA GTG GAA 

Arg 6 Asp Cys Arg Gly Gln^Tyr Leu Asp Tyr Glu^Lys Glu Glu Val Glu^ 

GCT CAG CCC AGG CAG TGG AAG AAA TAC GAC TTT CAC TAC GAC AAT GTG 

Ala 4 Gin Pro Arg Gin Trp Lys Lys Tyr Asp Phe His Tyr Asp Asn Val 
1340 1345 I" 0 

CTC TGG GCT CTG CTG ACG CTG TTC ACA GTG TCC ACG GGA GAA GGC TGG 

Leu^Trp Ala Leu Leu Thr Leu Phe Thr Val Ser Thr Gly Glu Gly Trp 
1360 1365 



1355 
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CCC ATG GTG CTG AAA CAC TCC GTG GAT GCC ACC TAT GAG GAG CAG GGT 

"J°Met Val Leu Lys His Ser Val Asp Ala Thr Tyr Glu Glu Gin Gly 
1370 1375 -380 

CCA AGC CCT GGG TAC CGC ATG GAG CTG TCC ATC TTC TAC GTG GTC TAC 

=ro%er Pro Gly Tyr Arg Met Glu Leu Ser lie Phe Tyr Val Val Tyr 
1385 1390 1395 

TTT GTG GTC TTT CCC TTC TTC TTC GTC AAC ATC TTT GTG GCT TTG ATC 
4336 

Phe Val Val Phe Pro Phe Phe Phe Val Asn He Phe Val Ala Leu lie 
140 0 1405 1410 1415 

ATC ATC ACC TTC CAG GAG CAG GGG GAC AAG GTG ATG TCT GAA TGC AGC 
4 38 

He Thr Phe 

1420 



lie" He Thr Phe Gin Glu Gin Gly Asp Lys Val Met Ser Glu Cys Ser 
1425 1430 

CTG GAG AAG AAC GAG AGG GCT TGC ATT GAC TTC GCC ATC AGC -GCC AAA ■ 

Leu Glu Lys Asn Glu Arg Ala Cys lie Asp Phe Ala He Ser Ala Lys 
1435 1440 1445 

CCC CTG ACA CGG TAC ATG CCC CAA AAC CGG CAG TCG TTC CAG TAT AAG 

Pro°Leu Thr Arg Tvr Met Pro Gin Asn Arg Gin Ser Phe Gin Tyr Lys 
1450 " 1455 1460 

ACG TGG ACA TTT GTG GTC TCC CCG CCC TTT GAA TAC TTC ATC ATG GCC 
4 528 

Thr Trp Thr Phe Val Val Ser Pro Pro Phe Glu Tyr Phe He Met Ala 
1465 1470 1475 

ATG ATA GCC CTC AAC ACT GTG GTG CTG ATG ATG AAG TTC TAT GAT GCA 

Mettle Ala Leu Asn Thr Val Val Leu Met Met Lys Phe Tyr Asp Ala 
1480 1485 _ 1490 1495 

CCC TAT GAG TAC GAG CTG ATG CTG AAA TGC CTG' AAC ATC GTG TTC ACA 
4624 

Tvr Glu Tyr 

1500 

TCC ATG TTC TCC ATG GAA TGC GTG CTG *. .G ATC ATC GCC TTT GGG GTG 
4 67 2 

Ser Met Phe Ser Met Glu Cys Val Leu Lys He He Ala Phe Gly Val 
1515 1520 1525 

CTG AAC TAT TTC AGA GAT GCC TGG AAT GTC TTT GAC TTT GTC ACT GTG 

leu°Asn Tyr Phe Arg Asp Ala Trp Asn Val Phe Asp Phe Val Thr Val 
1530 1535 1540 

TTG GGA AGT ATT ACT GAT ATT TTA GTA ACA GAG ATT GCG AAC AAT TTC 
47 68' 

Leu Gly Ser He Thr Asp He Leu Val Thr Glu He Ala Asn Asn Phe 
1545 1550 1555 



Pro Tyr Glu Tyr Glu Leu Met Leu. Lys Cys Leu Asn He Val Phe Thr 
y J 1505 1510 
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ATC AAC CTC AGC TTC CTC CGC CTC TTT CGA GCT GCG CGG CTG ATC AAG 

Ile^Asn Leu Ser Phe Leu Arq Leu Phe Arg Ala Ala Arg leu lie Lys 
1560 1565 1570 1575 

CTG CTC CGC CAG GGC TAC ACC ATC CGC ATC CTG CTG TGG ACC TTT GTC 

Leu 4 Leu Arg Gin Gly Tvr Thr He Arg lie Leu Leu Trp Thr Phe Val 
1580 * 1585 15S0 

CAG TCC TTC AAG GCC CTG CCC TAC GTG TGT CTG CTC ATT GCC ATG CTG 
4912 

Gin Ser Phe Lys Ala Leu Pro Tyr Val Cys Leu Leu He Ala Met i-eu 

1595 1600 1605 

TTC TTC ATC TAC GCC ATC ATC GGC ATG CAG GTG TTT GGG AAT ATT GCC 

Phe°Phe He Tyr Ala He He Gly Met Gin Val Phe Gly Asn lie Ala 
1610 1615 1620 

CTG GAT GAT GAC ACC AGC ATC AAC CGC CAC AAC AAC TTC CGG ACG TTT 

Leu^Asp Asp Asp Thr Ser He Asn Arg His Asn Asn Phe Arg Thr Phe 
1625 1630 1635 

TTG CAA GCC CTG ATG CTG CTG TTC AGG AGC GCC ACG GGG GAG GCC TGG 

Leu Gin Ala Leu Met Leu Leu Phe Arg Ser Ala Thr Gly Glu Ala Trp 
1640 1645 1650 1655 

CAC GAG ATC ATG CTG TCC TGC CTG AGC AAC CAG GCC TGT GAT GAG CAG 

Hi^Glu He Met Leu Ser Cys Leu Ser Asn Gin Ala Cys Asp Glu Gin 
1660 1665 1670 

GCC AAT GCC ACC GAG TGT GGA AGT GAC TTT GCC TAC TTC TAC TTC GTC 

Ala^Asn Ala Thr Glu Cys Gly Ser Asp Phe Ala Tyr Phe Tyr Phe Val 
1675 1680 -685 

TCC TTC ATC TTC CTG TGC TCC TTT 'CTG ATG TTG AAC CTC TTT GTG GCT 

Ser°Phe He Phe Leu Cys Ser Phe Leu Met Leu Asn Leu Phe Val Ala 
1690 1695 1700 

GTG ATC ATG GAC AAT TTT GAG TAC CTC ACG CGG GAC TCT TCC ATC CTA 
5248 

Val He Met Asp Asn Phe Glu Tyr Leu Thr Arg Asp Ser Ser He Leu 
1705 1710 1715 

GGT CCT CAC CAC TTG GAT GAG TTC ATC CGG GTC TGG GCT GAA TAC GAC 
5296 

Gly Pro His His Leu Asp Glu Phe He Arg Val Trp Ala Glu Tyr Asp 
1720 1725 1730 1735 

CCG GCT GCG TGT GGG CGC ATC AGT TAC AAT GAC ATG TTT GAG ATG CTG 
5344 

Pro Ala Ala Cys Gly Arg He Ser Tyr Asn Asp Met Phe Glu Met Leu 
1740 1745 1750 

AAA CAC ATG TCC CCG CCT CTG GGG CTG GGG AAG AAA TGC CCT GCT CGA 
5392 
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Lys His Met Ser Pro Pro Leu Gly Leu Gly Lys Lys Cys fro Ala Arg 
1 ^755 1760 ^765 

GTT GCT TAC AAG C3C CTG GTT CGC ATG AAC ATG CCC ATC TCC AAC GAG 

Vai°Ala Tvr Lys Arg Leu Val Arg Met Asn Met Pro lie Ser Asn Glu 
1770 J- 775 •' 8U 

GAC ATG ACT GTT CAC TTC ACG TCC ACG CTG ATG GCC CTC ATC CGG ACG 

Aso 8 Met Thr Val His Phe Thr Ser Thr Leu Met Ala Leu He Arg Thr 

1785 1 7 9° 1795 

GCA CTG GAG ATC AAG CTG GCC CCA GCT GGG ACA AAG CAG CAT CAG TGT 

"fLeu Glu lie Lys Leu Ala Pro Ala Gly Thr Lys Gin His Gin Cys 
1800 1805 1810 1815 

GAC GCG GAG TTG AGG AAG GAG ATT TCC GTT GTG TGG GCC AAT CTG CCC 

Aso\la Glu Leu Arg Lys Glu He Ser Val Val Trp Ala Asn Leu Pro 
1820 1825 i" u 

CAG AAG ACT TTG GAC TTG CTG GTA CCA CCC CAT AAG CCT GAT <5AG ATG 

"n 2 Lys Thr Leu Asp Leu Leu Val Pro Pro His Lys Pro Asp Glu Met 
1835 1840 1845 

ACA GTG GGG AAG GTT TAT GCA GCT CTG ATG ATA TTT GAC TTC TAC AAG 

Thr°Val Gly Lys Val Tyr Ala Ala Leu Met lie Phe Asp Phe Tyr Lys 
1850 1855 I860 

CAG AAC AAA ACC ACC AGA GAC CAG ATG CAG CAG GCT CCT GGA GGC CTC 

Gln 8 Asn Lys Thr Thr Arg Asp Gin Met Gin Gin Ala Pro Gly Gly Leu 
1865 1870 18 7 5 

TCC CAG ATG GGT CCT GTG TCC CTG TTC CAC CCT CTG AAG GCC ACC CTG 

S« 6 Gln Met Gly Pro Val Ser Leu Phe His Pro Leu Lys Ala Thr Leu 
1880 1885 1890 1895 

GAG CAG ACA CAG CCG GCT GTG CTC CGA GGA GCC CGG GTT TTC CTT CGA 

Glu 4 Gin Thr Gin Pro Ala Val Leu Arg Gly Ala Arg Val Phe Leu Arg 
1900 1905 191° 

CAG AAG AGT TCC ACC TCC CTC AGC AAT GGC GGG GCC ATA CAA AAC CAA 

Gin 2 Lys Ser Ser Thr Ser Leu Ser Asn Gly Gly Ala He Gin Asn <Un 
1915 1920 1925 

GAG AGT GGC ATC AAA GAG TCT GTC TCC TGG GGC ACT CAA AGG ACC CAG 

llu°Ser Gly He Lys Glu Ser Val Ser Trp Gly Thr Gin Arg Thr Gin 
1930 1935 I* 40 

GAT GCA CCC CAT GAG GCC AGG CCA CCC CTG GAG CGT GGC CAC TCC ACA 

CQCO 

Asp Ala Pro His Glu Ala Arg Pro Pro Leu Glu Arg Gly His Ser Thr 
1945 1950 1955 
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GAG ATC CCT GTG GGG CGG TCA GGA GCA CTG GCT GTG GAC CTT CAG ATG 
6016 

Glu lie Pro Val Gly Arg Ser Gly Ala Leu Ala Val Asp Val Gin Met 
I960 1965 1970 1975 

CAG AGC ATA ACC CGG AGG GGC CCT GAT GGG GAG CCC CAG CCT GGG CTG 
6064 

Gin Ser He Thr Arg Arg Gly Pro Asp Gly Glu Pro Gin Pro Gly Leu 
1980 1985 1990 

GAG AGC CAG GGT CGA GCG GCC TCC ATG CCC CGC CTT GCG GCC GAG ACT 
6112 

Glu Ser Gin Gly Arg Ala Ala Ser Met Pro Arg Leu Ala Ala Glu Thr 
1995 2000 2005 

CAG CCC GTC ACA GAT GCC AGC CCC ATG AAG CGC TCC ATC TCC ACG CTG 
6160 

Gin Pro Val Thr Asp Ala Ser Pro Met Lys Arg Ser He Ser Thr Leu 
2010 2015 2020 

GCC CAG CGG CCC CGT GGG ACT CAT CTT TGC AGC ACC ACC CCG GAC CGC 
6208 

Ala Gin Arg Pro Arg Gly Thr His Leu Cys Ser Thr Thr Pro Asp Arg 
2025 2030 2035 

CCA CCC CCT AGC CAG GCG TCG TCG CAC CAC CAC CAC CAC CGC TGC CAC 

6256 ^ 

Pro Pro Pro Ser Gin Ala Ser Ser His His His His His Arg Cys His 

2040 2045 2050 2055 

CGC CGC AGG GAC AGG AAG CAG AGG TCC CTG GAG AAG GGG CCC AGC CTG 
6304 

Ara Arq Arg Asp Arg Lys* Gin Arg Ser Leu Glu Lys Gly Pro Ser Leu 
* 2060 2065 2070 

TCT GCC GAT ATG GAT GGC GCA CCA AGC AGT GCT GTG GGG CCG GGG CTG 
6352 

Ser Ala Asp Met Asp Gly Ala Pro Ser Ser Ala Val Gly Pro Gly Leu 
2075 2080 2085 

CCC CCG GGA GAG GGG CCT ACA GGC TGC CGG CGG GAA CGA GAG CGC CGG 
6400 

Pro Pro Gly Glu Gly Pro Thr Gly Gys Arg Arg Glu Arg Glu Arg Arg 
2090 2095 2100 

CAG GAG CGG AGC CGG TCC CAG GAG CGG AGG CAG CCC TCA TCC TCC TCC 

Gl^Glu Arg Ser Arg Ser Gin Glu Arg Arg Gin Pro Ser Ser Ser Ser 
2105 . 2110 2115 

TCG GAG AAG CAG CGC TTC TAC TCC TGC GAC CGC TTT GGG GGC CGT GAG 

Ser^Glu Lys Gin Arg Phe Tyr Ser Cys Asp Arg Phe Gly Gly Arg Glu 
2120 2125 2130 2135 

CCC CCG AAG CCC AAG CCC TCC CTC AGC AGC CAC CCA ACG TCG CCA ACA 

Pro"* Pro Lys Pro Lys Pro Ser Leu Ser Ser His Pro Thr Ser Pro Thr 
2140 2145 2150 
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GCT GGC CAG GAG CCG GGA CCC CAC CCA CAG GGC AGT GGT TCC GTG AAT 

Alf 2 Gly Gin Glu Pro Gly Pro His Pro Gin Gly Ser Gly Ser Val Asn 
2155 2160 2165 

GGG AGC CCC T7G CTG TCA ACA TCT GGT GCT AGC ACC CCC GGC CGC GGT 

Glt°ser Pro Leu Leu Ser Thr Ser Gly Ala Ser Thr Pro Gly Arg Gly 
3 2170 2175 2180 

GGG CGG AGG CAG CTC CCC CAG ACG CCC CTG ACT CCC CGC CCC AGC ATC 

Gl^Arg Arg Gin Leu Pro Gin Thr Pro Leu Thr Pro Arg Pro Ser He 
Y 2185 2190 2195 

ACC TAC AAG ACG GCC AAC TCC TCA CCC ATC CAC TTC GCC GGG GCT CAG 

^nr 6 Tyr Lys Thr Ala Asn Ser Ser Pro lie His Ptee Ala Gly Ala Gin 
2200 2205 2210 2215 

ACC AGC CTC CCT GCC TTC TCC CCA GGC CGG CTC AGC C3T GGG CTT TCC 

Thr 4 Ser Leu Pro Ala Phe Ser Pro Gly Arg Leu Ser Arg Gly Leu Ser 
2220 2225 2230 

GAA CAC AAC GCC CTG CTG CAG AGA GAC CCC CTC AGC CAG CCC CTG GCC 
6832 

Glu His Asn Ala Leu Leu Gin Arg Asp Pro Leu Ser Gin Pro Leu Ala 
2235 2240 2245 

CCT GGC TCT CGA ATT GGC TCT GAC CCT TAC CTG GGG CAG CGT CTG GAC 
6880 

Pro Gly Ser Arg He Gly Ser Asp Pro Tyr Leu Gly Gin Arg Leu Asp 
2250 2255 2260 

AGT GAG GCC TCT GTC CAC GCC CTG CCT GAG GAC ACG CTC ACT TTC GAG 

Ser Glu Ala Ser Val His Ala Leu Pro Glu Asp Thr Leu Thr Phe Glu 
2265 2270 2275 

GAG GCT GTG GCC ACC AAC TCG GGC CGC TCC TCC AGG ACT TCC TAC GTG 

Glu 6 Ala Val Ala Thr Asn Ser Gly Arg Ser Ser Arg Thr Ser Tyr Val 
2280 2285 2290 2295 

TCC TCC CTG ACC TCC CAG TCT CAC CCT CTC CGC CGC GTG CCC AAC GGT 

Ser 4 Ser Leu Thr Ser Gin Ser His Pro Leu Arg Arg Val Pro Asn Gly 
2300 2305 2310 

TAC CAC TGC ACC CTG GGA CTC AGC TCG GGT GGC CGA GCA CGG CAC AGC 

Tyr His Cys Thr Leu Gly Leu Ser Ser Gly Gly Arg Ala Arg His Ser 
y 2315 2320 2325 

TAC CAC CAC CCT GAC CAA GAC CAC TGG TGC TAGCTGCACC GTGACCGCTC 
7122 

Tvr His His Pro Asp Gin Asp His Trp Cys 
2330 2335 

AGACGCCTGC ATGCAGCAGG CGTGTGTTCC AGTGGATGAG TTTTATCATC CACACGGGGC 
7182 
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AGTCGGCCC7 CGGGGGAGGC CTTGCCCACC . TTGGTGAGGC TCCTGTGGCC CCTCCCTCCC 
7242 

CCTCCTCCCC TCTTTTACTC TAGA 
7266 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2337 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
Met Val Arc Phe Glv Asp Glu Leu Gly Gly Arg Tyr Gly Gly Pro Gly 

^ in 1 S 



1 



10 15 



Ser Gly Glu Arg Ala Arg Gly Gly Gly Ala Gly Gly Ala Gly Giy Pro 
20 25 30 

Gly Pro Gly Gly Leu Gin Pro Gly Gin Arg Val Leu Tyr Lys Gin Ser 
35 40 45 

lie Ala Gin Arg Ala Arg Thr Met Ala Leu Tyr Asn Pro lie Pro Val 

50 55 60 . 

Lys Gin Asn Cys Phe Thr Val Asn Arg Ser Leu Phe Val Phe Ser Glu 
65 70 75 80 

Asp Asn Val Val Arg Lys Tyr Ala Lys Arg lie Thr Glu Trp Pro Pro 
K 85 90 95 

Phe Glu Tyr Met He Leu Ala Thr He He Ala Asn Cys He Val Leu 
100 105 HO 

Ala Leu Glu Gin His Leu Pro Asp Gly Asp Lys Thr Pro Met Ser Glu 
115 120 - I 25 

Arg Leu Asp Asp Thr Glu Pro Tyr Phe He Gly He Phe Cys Phe Glu 
130 135 140 

Ala Glv He Lys He He Ala Leu Gly Phe Val Phe His Lys Gly Ser 
145 r 150 155 160 

Tvr Leu Arg Asn Gly Trp Asn Val Met Asp Phe Val Val Val Leu Thr 
* 165 170 175 

Glv He Leu Ala Thr Ala Gly Thr Asp Phe Asp Leu Arg Thr Leu Arg 
180 185 190 

Ala Val Arg Val Leu Arg Pro Leu Lys Leu Val Ser Gly He Pro Ser 
195 200 205 

Leu Gin Val Val Leu Lys Ser He Met Lys Ala Met Val Pro Leu Leu 
210 215 220 

Gin He Gly Leu Leu Leu Ph Phe Ala He L u M t Phe Ala He He 
225 230 235 240 
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Gly Leu Glu Phe Tyr Met Giy Lys Phe His Lys Ala Cys Phe Pro Asn 
3 245 250 255 

Ser "hr Asd Ala Glu Pro Val Gly Asp Phe Pro Cys Giy Lys Glu Ala 
- 265 270 



260 



=ro Ala Arg Leu Cys Glu Gly Asp Thr Glu Cys Arg Glu Tyr Trp Pro 

275 280 28:5 

Glv Pro Asn Phe Gly lie Thr Asn Phe Asp Asn lie Leu Phe Ala lie 

290 295 300 

Leu Thr Val Phe Gin Cys lie Thr Met Glu Gly Trp Thr Asp He Leu 

305 310 315 320 

Tyr Asn Thr Asn Asp Ala Ala Gly Asn Thr Trp Asn Trp Leu Tyr Phe 

■* O "5 C lift J jj 



330 335 



He Pro Leu He He He Gly Ser Phe Phe Met Leu Asn Leu Val Leu 
340 345 350 

Gly Val Leu Ser Gly Glu Phe Ala Lys Glu Arg Glu Arg Val Glu Asn 
Y 355 360 365 

Arq Arg Ala Phe Leu Lys Leu Arg Arg Gin Gin Gin He Glu Arg Glu 
370 375 380 

Leu Asn Gly Tyr Leu Glu Trp He Phe Lys Ala Glu Glu Val Met Leu 
385 390 395 400 

Ala Glu Glu Asp Arg Asn Ala Glu Glu Lys Ser Pro Leu Asp Val Leu 

4 10 4 13 



405 



Lys Arg Ala Ala Thr Lys Lys Ser Arg Asn Asp Leu He His Ala Glu 
Y 420 425 "30 

Glu Gly Glu Asp Arg Phe Ala Asp Leu Cys Ala Val Gly Ser Pro Phe 
435 440 445 

Ala Arg Ala Ser Leu Lys Ser Gly "Lys Thr Glu Ser Ser Ser Tyr Phe 
450 455 460 

Arg Arg Lys Glu Lys Met Phe Arg Phe Phe He Arg Arg Met Val Lys 
465 470 475 480 

Ala Gin Ser Phe Tyr Trp Val Val Leu Cys Val Val Ala Leu Asn Thr 
490 433 



485 



Leu Cys Val Ala Met Val His Tyr Asn Gin Pro Arg Arg Leu Thr Thr 
500 505 

Thr Leu Tyr Phe Ala Glu Phe Val Phe Leu Gly Leu Phe Leu Thr Glu 
515 520 525 

Met Ser Leu Lys Met Tyr Gly Leu Gly Pro Arg Ser Tyr Phe Arg Ser 
530 535 540 

Ser Phe Asn Cys Phe Asp Phe Gly Val lie Val Gly Ser Val Phe Glu 
545 550 555 

Val Val Trp Ala Ala lie Lys Pro Gly Ser Ser Phe Gly He Ser Val 
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565 570 575 

Leu Arg Ala Leu Arg Leu Leu Arg lie ?he Lys Val Thr Lys Tyr Trp 
580 585 590 

Ser Ser Leu Arg Asn Leu Val Val Ser Leu Leu Asn Ser Met Lys Ser 
595 600 605 

He Pe Ser Leu Leu Phe Leu Leu Phe Leu Phe He Val Val Phe Ala 
610 615 620 

Leu Leu Gly Met Gin Leu Phe Gly Gly Gin Phe Asn Phe Gin Asp Glu 

625 630 635 640 

Thr Pro Thr Thr Asn Phe Asp Thr Phe Pro Ala Ala He Leu Thr Val 

645 650 655 

Phe Gin He Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr His Gly 
660 665 670 

He Glu Ser Gin Gly Gly Val Ser Lys Gly Met Phe Ser Ser Phe Tyr 
675 680 685 

Phe He Val Leu Thr Leu Phe Gly Asn Tyr Thr Leu Leu Asn Val Phe 
690 695 700 

Leu Ala He Ala Val Asp Asn Leu Ala Asn Ala Gin Glu Leu Thr Lys 
705 710 715 720 

Asp Glu Glu Glu Met Glu Glu Ala Ala Asn Gin Lys Leu Ala Leu Gin 
725 730 735 

Lys Ala Lys Glu Val Ala Glu Val Ser Pro Met Ser Ala Ala Asn He 
740 745 750 

Ser He Ala Ala Arg Gin Gin Asn Ser Ala Lys Ala Arg Ser Val Trp 
755 760 765 

Glu Gin Arg Ala Ser Gin Leu Arg Leu Gin Asn Leu Arg Ala Ser Cys 
770 775 780 

Glu Ala Leu Tyr Ser Glu Met Asp Pro Glu Glu Arg Leu Arg Phe Ala 
785 790 795 800 

Thr Thr Arg His Arg Arg Pro Asp Met Lys Thr His Leu Asp Arg Pro 
805 810 815 

Leu Val Val Glu Leu Gly Arg Asp Gly Ala Arg Gly Pro Val Gly Gly 
820 825 830 

Lys Ala Arg Pro Glu Ala Ala Glu Ala Pro Glu Gly Val Asp Pro Pro 
835 840 845 

Arq Arg His His Arg His Arg Asp Lys Asp Lys Thr Pro Ala Ala Gly 
850 855 860 

Asp Gin Asp Arg Ala Glu Ala Pro Lys Ala Glu Ser Gly Glu Pro Gly 
865 870 875 880 

Ala Arq Glu Glu Arg Pro Arg Pro His Arg Ser His Ser Lys Glu Ala 
885 890 895 

Ala Gly Pro Pro Glu Ala Arg Ser Glu Arg Gly Arg Gly Pro Gly Pro 
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900 905 910 

31u Gly Gly Arg Arg His His Arg Arg Gly Ser Fro Glu Glu Ala Ala 
915 920 925 

Glu Arg Glu Pro Arg Arg His Arg Ala His Arg His Gin Asp Pro Ser 
930 935 »«0 

Lys Glu Cys Ala Gly Ala Lys Gly Glu Arg Arg Ala Arg His Arg Gly 
945 950 955 »«>u 

Gly Pro Arg Ala Gly Pro Arg Glu Ala Glu Ser Gly Glu Glu Pro Ala 

965 970 
Arg Arg His Arg Ala Arg His Lys Ala Gin Pro Ala His Glu Ala Val 
960 985 990 

Glu Lys Glu Thr Thr Glu Lys Glu Ala Thr Glu Lys Glu^Ala Glu lie 

995 1000 

Val Glu Ala Asp Lys Glu Lys Glu Leu Arg Asn His Gin Pro Arg Glu 

1010 ioi- - 020 

Pro His Cys Asp Leu Glu Thr Ser Gly Thr Val Thr Val Gly Pro Mjt 

1025 1030 IUJd 

His Thr Leu Pro Ser Thr Cys Leu Gin Lys Val Glu Glu Gin Pro Glu 
1045 1050 

Asp Ala Asp Asn Gin Arg Asn Val Thr Arg Met Gly Ser Gin Pro Pro 
1060 1065 l°' u 

Asp Pro Asn Thr He Val His He Pro Val Met Leu Thr Gly Pro Leu 
y 10 75 1080 1° 85 

Gly Glu Ala Thr Val Val Pro Ser Gly Asn Val Asp Leu Glu Ser Gin 
1090 1095 11°° 

Ala Glu Gly Lys Lys Glu Val Glu Ala Asp Asp Val Met Arg Ser Gly 
1105 111° 

Pro Arg Pro lie Val Pro Tyr Ser 'Ser Met Phe Cys Leu Ser Pro Thr 
1125 H 30 ii 

Asn Leu Leu Arg Arg Phe Cys His Tyr He Val Thr Met Arg Tyr Phe 
1140 H45 ll 50 

Glu Val Val He Leu Val Val He Ala. Leu Ser Ser lie Ala Leu Ala 
1155 H«0 ll 65 

Ala Glu Asp Pro Val Arg Thr Asp Ser Pro Arg Asn Asn Ala Leu Lys 
1170 H75 1 1B0 

Tyr Leu Asp Tyr lie Phe Thr Gly Val Phe Thr Phe Glu Met Val He 
1185 119P 1195 "0° 

Lys Met He Asp Leu Gly Leu Leu Leu His Pro Gly Ala Tyr Phe Arg 
* 1205 1210 1215 

Asp Leu Trp Asn lie Leu Asp Phe He Val Val Ser Gly Ala Leu Val 
* 1220 1225 1230 

Ala Phe Ala Phe Ser Gly Ser Lys Gly Lys Asp He Asn Thr He Lys 
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1235 1240 1243 

Ser Leu kra Val Leu Ara Val Leu Arg Pro Leu Lys Thr lie Lys Arg 
1250 * 1255 1260 

Leu Pro Lvs Leu Lys Ala Val Phe Asp Cvs Val Val Asn Ser Leu Lys 
1265 " 1270 1275 1280 

Asn Val Leu Asn lie Leu lie Val Tyr Met Leu Phe Met Phe He Phe 
1285 1290 1295 

Ala Val He Ala Val Gin Leu Phe Lys Gly Lys Phe Phe Tyr Cys Thr 

1300 1305 1310 

Asp Glu Ser Lys Glu Leu Glu Arg Asp Cys Arg Gly Gin Tyr Leu Asp 
1315 1320 1325 

Tyr Glu Lys Glu Glu Val Glu Ala Gin Pro Arg Gin Trp Lys Lys Tyr 
1330 1335 1340 

Asp Phe His Tyr Asp Asn Val Leu Trp Ala Leu Leu Thr Leu Phe Thr 
1345 1350 1355 1360 

Val Ser Thr Gly Glu Gly Trp Pro Met Val Leu Lys His Ser Val Asp 
1365 1370 1375 

Ala Thr Tyr Glu Glu Gin Gly Pro Ser Pro Gly Tyr Arg Met Glu Leu 
1380 1385 1390 

Ser He Phe Tyr Val Val Tyr Phe Val Val Phe Pro Phe Phe Phe Val 
1395 1400 1405 

Asn He Phe Val Ala Leu Tie He He Thr Phe Gin Glu Gin Gly Asp 
1410 1415 1420 

Lys Val Met Ser Glu Cys Ser Leu Glu Lys Asn Glu Arg Ala Cys He 
1425 1430 1435 1440 

Asp Phe Ala He Ser Ala Lys Pro Leu Thr Arg Tyr Met Pro Gin Asn 
1445 1450 1455 

Arg Gin Ser Phe Gin Tyr Lys Thr Trp Thr Phe Val Val Ser Pro Pro 
1460 1465 1470 

Phe Glu Tyr Phe He Met Ala Met He Ala Leu Asn Thr Val Val Leu 
1475 1480 1485 

Met Met Lys Phe Tyr Asp Ala Pro Tyr Glu Tyr Glu Leu Met Leu Lys 
1490 1495 1500 

Cys Leu Asn He Val Phe Thr Ser Met Phe Ser Met Glu Cys Val Leu 
1505 1510 1515 1520 

Lys He He Ala Phe Gly Val Leu Asn Tyr Phe Arg Asp Ala Trp Asn 
1525 1530 1535 

Val Phe Asp Phe Val Thr Val Leu Gly Ser He Thr Asp lie Leu Val 
1540 1545 1550 

Thr Glu II Ala Asn Asn Phe He Asn Leu Ser Phe L u Arg Leu Phe 
1555 1560 1565 

Arg Ala Ala Arg Leu He Lys Leu Leu Arg Gin Gly Tyr Thr He Arg 
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1570 1575 1580 

He Leu Leu Tro Thr Phe Val Gin Ser Phe Lys Ala Leu Pro Tyr Val 
i585 * 1590 1595 1600 

Cvs Leu Leu He Ala Met Leu Phe Phe He Tyr Ala He He Gly Met 
1605 1610 1615 

Gin Val Phe Glv Asn He Ala Leu Asd Asp Asp Thr Ser He Asn Arg 
162C 1625 1630 

His Asn Asn Phe Arg Thr Phe Leu Gin Ala Leu Met Leu Leu Phe Arg 

1635 1640 1645 

Ser Ala Thr Gly Glu Ala Trp His Glu He Met Leu Ser Cys Leu Ser 
1650 1655 1660 

Asn Gin Ala Cys Asp Glu Gin Ala Asn Ala Thr Glu Cys Gly Ser Asp 
1665 1670 1675 1680 

Phe Ala Tvr Phe Tyr Phe Val Ser Phe He Phe Leu Cys Ser Phe Leu 
1685 1690 1695 

Met Leu Asn Leu Phe Val Ala Val He Met Asp Asn Phe Glu Tyr Leu 
1700 1705 1710 

Thr Arg Asp Ser Ser He Leu Gly Pro His His Leu Asp Glu Phe He 
1715 1720 1725 

Arg Val Trp Ala Glu Tyr Aso Pro Ala Ala Cys Gly Arg He Ser Tyr 
1730 1735 1740 

Asn Asp Met Phe Glu Met Leu Lys His Met Ser Pro Pro Leu Gly Leu 
1745 1750 1755 1760 

Glv Lvs Lvs Cys Pro Ala Arg Val Ala Tyr Lys Arg Leu Val Arg Met 
1765 1770 1775 

Asn Met °ro He Ser Asn Glu Asp Met Thr Val His Phe Thr Ser Thr 
1780 1785 1790 

Leu Met Ala Leu He Arg Thr Ala "Leu Glu He Lys Leu Ala Pro Ala 
1795 1800 1805 

Gly Thr Lys Gin His Gin Cys Asp Ala Glu Leu Arg Lys Glu He Ser 
1810 1815 1820 

Val Val Trp Ala Asn Leu Pro Gin Lys Thr Leu Asp Leu Leu Val Pro 
1825 1830 * 1835 1840 

Pro His Lys Pro Asp Glu Met Thr Val Gly Lys Val Tyr Ala Ala Leu 
1845 1850 1855 

Met He Phe Asp Phe Tyr Lys Gin Asn Lys Thr Thr Arg Asp Gin Met 
I860 1865 1870 

Gin Gin Ala Pro Gly Gly Leu Ser Gin Met Gly Pro Val Ser Leu Phe 
1875 1880 1885 

His Pro Leu Lys Ala Thr Leu Glu Gin Thr Gin Pro Ala Val Leu Ar^ 
1890 1895 1900 

Gly Ala Arg Val Phe Leu Arg Gin Lys Ser Ser Thr Ser Leu Ser Asn 
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1905 191C 1915 1920 

Glv Gly Ala lie Gin Asn Gin Glu Ser Gly He Lys Glu Ser Val Ser 
1925 1930 1935 

Trp Gly Thr Gin Arg Thr Gin Asp Ala Pro His Glu Ala Arg Pro Pro 
1940 1945 1950 

Leu Glu Arg Gly His Ser Thr Glu He Pro Val Gly Arg Ser Gly Ala 
1955 I960 1965 

Leu Ala Val Asp Val Gin Met Gin Ser lie Thr Arg Arg Gly Pro Asp 

1970 1975 1980 

Glv Glu Pro Gin Pro Gly Leu Glu Ser Gin Gly Arg Ala Ala Ser Met 
1965 1990 1995 2000 

Pro Arq Leu Ala Ala Glu Thr Gin Pro Val Thr Asp Ala Ser Pro Met 
^ 2005 2010 2015 

Lvs Arg Ser lie Ser Thr Leu Ala Gin Arg Pro Arg Gly Thr His Leu 
2020 2025 2030 

Cys Ser Thr Thr Pro Asp Arg Pro Pro Pro Ser Gin Ala Ser Ser His 
2035 2040 2045 

His His His His Arg Cys His Arg Arg Arg Asp Arg Lys Gin Arg Ser 
2050 2055 2060 

Leu Glu Lys Gly Pro Ser Leu Ser Ala Asp Met Asp Gly Ala Pro Ser 
2065 2070 2075 2080 

Ser Ala Val Gly Pro Gly Leu Pro Pro Gly Glu Gly Pro Thr Gly Cys 
2085 2090 2095 

Ara Arg Glu Arg Glu Arg Arg Gin Glu Arg Ser Arg Ser Gin Glu Arg 
y 2100 2105 2110 

Ara Gin Pro Ser Ser Ser Ser Ser Glu Lys Gin Arg Phe Tyr Ser Cys 
2115 2120 2125 

Aso Arq Phe Gly Gly Arg Glu Pro Pro Lys Pro Lys Pro Ser Leu Ser 
2130 2135 2140 

Ser His Pro Thr Ser Pro Thr Ala Gly Gin Glu Pro Gly Pro His Pro 
2145 2150 2155 2160 

Gin Gly Ser Gly Ser Val Asn Gly Ser Pro Leu Leu Ser Thr Ser Gly 
2165 2170 2175 

Ala Ser Thr Pro Gly Arg Gly Gly Arg Arg Gin Leu Pro Gin Thr Pro 
2180 2185 2190 

Leu Thr Pro Arg Pro Ser He Thr Tyr Lys Thr Ala Asn Ser Ser Pro 
2195 2200 2205 

He His Phe Ala Gly Ala Gin Thr Ser Leu Pro Ala Phe Ser Pro Gly 
2210 2215 2220 

Arq Leu Ser Arg Gly Leu Ser Glu His Asn Ala Leu Leu Gin Arg Asp 
2225 2230 2235 2240 

Pro Leu Ser Gin Pro Leu Ala Pro Gly Ser Arg He Gly Ser Asp Pro 
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2245 2250 2255 

-yr Leu Gly Gin Arg Leu Asp Ser Glu Ala Ser Val His Ala Leu Pro 
2260 2265 2270 

Glu Asp T hr Leu Thr Phe Glu Glu Ala Val Ala Thr Asn Ser Gly Arg 
221b 2290 2285 

Ser Ser Arg ^hr Ser Tyr Val Ser Ser Leu Thr Ser Gin Ser His Pro 
2290 2295 2300 

-eu Arg Arg Val Pro Asn Gly Tyr His Cys Thr Leu Gly Leu Ser Ser 
2305 2310 2315 2320 

Glv Gly Arg Ala Arg His Ser Tyr His His Pro Asp Gin Asp Hxs Trp 

Y Y 2325 2330 2335 

Cys 

;2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3298 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 20.. 3292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTGATCTTCG ATCGCGAAG ATG GCT GCT GGC TGC CTG CTG GCC TTG ACT CTG 

° 2 Met Ala Ala Gly Cys Leu Leu Ala Leu Thr Leu 

234D 234.5 

ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG TCG GAG GAG CCG" TTC 

Thr Leu Phe Gin Ser Leu Leu He Gly Pro Ser Ser Glu Glu Pro Phe 
2350 2355 2360 

CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT AAG ATG CAA GAA GAC 
148 

Pro Ser Ala Val Thr lie Lys Ser Trp Val Asp Lys Met Gin Glu Asp 
2365 2370 2375 2380 

CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC AAT CAG CTT GTT GAT 

1 96 

Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val Asn Gin Leu Val Asp 

2385 2390 2395 

ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG GAA CCA AAT AAT GCA 
244 

He Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val Glu Pro Asn Asn Ala 
2400 2405 2410 
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CGC CAG CTG G7A GAA ATT GCA GCC AGG GAT ATT GAG AAA CTT CTG AGC 
292 

Arg Gin Leu Val Glu lie Ala Ala Arg Asp He GIu Lys Leu Leu Ser 
2415 2420 2425 

AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG GAA GCG GAG AAA GTT 
340 

Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu Glu Ala du Lys Val 

2430 2435 2440 

CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA AGC AAT GAA GTT GTC 
368 

Gin Ala Ala His Gin Tro Arg Glu Asp Phe Ala Ser Asn Glu Val Val 
2445 2450 2455 2460 

TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG AAA AAT GAC AGT GAG 
436 

Tvr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu Lys Asn Asp Ser Glu 
2465 2470 2475 

CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT GAA GAT GCT AAT TTT 

Pro dv Ser Gin Arg lie Lys Pro Val Phe He Glu Asp Ala Asn Phe 
2480 2485 2490 

GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC CAT ATT CCT ACT GAC 
532 

Gly Arg Gin lie Ser Tyr Gin His Ala Ala Val His He Pro Thr Asp 
2495 2500 2505 

ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA CTC AAC TGG ACA AGT 
580 

He Tyr Glu Gly Ser Thr He Val Leu Asn Glu Leu Asn Trp Thr Ser 
2510 2515 2520 

GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG GAA GAC CCT TCA TTA 
628 

Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu Glu Asp Pro Ser Leu 

2525 ' 2530 2535 2540 

TTG TGG CAG GTT TTT GGC AGT GCC -ACT GGC CTA GCT CGA TAT TAT CCA 
676 

Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu Ala Arg Tyr Tyr Pro 
2545 2550 2555 

GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA AAT AAG ATT GAC CTT 
724 

Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro Asn Lys He Asp Leu 
2560 2565 2570 

TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA GGA GCT GCA TCT CCT 
772 

Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin Gly Ala Ala Ser Pro 
2575 2580 2585 

AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA AGT GTT AGT GGA TTG 
820 

Lys Asp Met Leu He Leu Val Asp Val Ser Gly Ser Val Ser Gly Leu 
2590 2595 2600 

ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA ATG TTA GAA ACC CTC 
868 
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Thr Leu Lys Leu lie Arg Thr Ser Val Ser Glu Me-. Leu Glu Thr Leu 
2605 2610 2615 2620 

TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT AAC AGC AAT GCT CAG 

Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe Asn Ser Asn Ala Gin 
2625 2630 2635 

GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA AAT GTA AGA AAT AAA 

964 

Asp Val Ser Cys Phe Gin His Leu Val Gin Ala Asn Val Arg Asn Lys 
---- 2645 2650 



2640 



AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA GCC AAA GGA ATT ACA 

L?s 2 Val Leu Lys Aso Ala Val Asn Asn He Thr Ala Lys Gly lie Thr 
2655 * 2660 2665 

GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA CAG CTG CTT AAT TAT 

Asp°Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu Gin Leu Leu Asn Tyr 
2670 2675 2680 

AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT ATG CTA TTC ACG GAT 

Asn 8 Val Ser Arg Ala Asn Cys Asn Lys He lie Met Leu Phe Thr Asp 
2685 2690 2695 2700 

GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC AAA TAC AAT AAA GAT 

Cicely Glu Glu Ara Ala Gin Glu He Phe Asn Lys Tyr Asn Lys Asp 
y y 2705 2710 2715. 

AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT CAA CAC AAT TAT GAG 

lys 4 Lys Val Arg Val Phe Arg Phe Ser Val Gly Gin His Asn Tyr Glu 
2720 2725 2730 

AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC AAA GGT TAT TAT TAT 

1252 

Arg Gly Pro He Gin Trp Met Ala "Cys Glu Asn Lys Gly Tyr Tyr Tyr 
2735 2740 2745 

GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT ACT CAG GAA TAT TTG 

Glu°ile Pro Ser He Gly Ala He Arg He Asn Thr Gin Glu Tyr Leu 
2750 2755 2760 

GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA GAC AAA GCT AAG CAA 

Asp 8 Val Leu Gly Arg Pro Met Val Leu Ala Gly Asp Lys Ala Lys Gin 
2765 2770 2775 2780 

GTC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG GAA CTG GGA CTT GTC 

1396 

Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu Glu Leu Gly Leu Val 
2785 2790 2795 

ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC GGC CAA TTT GAA AAT 

He 4 Thr Gly Thr Leu Pro Val Phe Asn He Thr Gly Gin Phe Glu Asn 
2800 2805 2810 
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AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT GTG ATG GGA GTA GAT 
1492 

Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly Val Met Gly Val Asp 
2815 2820 2825 

GTG TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA CGT TTT ACA CTG TGC 
1540 

Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro Arg Phe Thr Leu Cys 
2830 2835 2840 

CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT GGT TAT GCT TTA TTA 
1588 

Pro Asn Gly Tyr Tyr Phe Ala He Asp Pro Asn Gly Tyr Ala Leu Leu 
2845 2850 2855 2860 

CAT CCA AAT CTT CAG CCA AAG AAC CCC AAA TCT CAG GAG CCA GTA ACA 
1636 

His Pro Asn Leu Gin Pro Lys Asn Pro Lys Ser Gin Glu Pro Val Thr 
2865 2870 2875 

TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT ATT AAA GTG GAG ATT 
1684 

Leu Asp Phe Leu Asp Ala Glu Leu Glu Asn Asp He Lys Val Glu He 
2880 2885 2890 

CGA AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA AAA ACA TTC AGA ACT 
1732 

Arq Asn Lvs Met He Asp Gly Glu Ser Gly Glu Lys Thr Phe Arg Thr 
2895 2900 2905 

CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC AAA GGA AAC AGG ACA 
1780 

Leu Val Lys Ser Gin Asp* Glu Arg Tyr He Asp Lys Gly Asn Arg Thr 
2910 2915 2920 

TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC AGT TTG GCC TTG GTA 
1828 

Tvr Thr Trp Thr Pro Val Asn Gly Thr Asp Tyr Ser Leu Ala Leu Val 
2925 2930 _ 2935 2940 

TTA CCA ACC TAC AGT TTT TAC TAT ATA AAA GCC AAA CTA GAA GAG ACA 
187 6 

Leu Pro Thr Tyr Ser Phe Tyr Tyr He Lys Ala Lys Leu Glu Glu Thr 

2945 2950 2955 

ATA ACT CAG GCC AGA TCA AAA AAG GGC AAA ATG AAG GAT TCG GAA ACC 
1924 

He Thr Gin Ala Arg Ser Lys Lys Gly Lys Met Lys Asp Ser Glu Thr 
2960 2965 2970 

CTG AAG CCA GAT AAT TTT GAA GAA TCT GGC TAT ACA TTC ATA GCA CCA 
1972 

Leu Lys Pro Asp Asn Phe Glu Glu Ser Gly Tyr Thr Phe He Ala Pro 
2975 2980 2985 

AGA GAT TAC TGC AAT GAC CTG AAA ATA TCG GAT AAT AAC ACT GAA TTT 
2020 

Ara Asp Tyr Cys Asn Asp Leu Lys He Ser Asp Asn Asn Thr Glu Phe 
2990 2995 3000 
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CTT TTA AAT TTC AAC GAG TTT ATT GAT AGA AAA ACT CCA AAC AAC CCA 

Leu 8 Leu Asn Phe Asn Glu Phe lie Asp Arg Lys Thr Pro Asn Asn Pro 
3005 3010 3015 3020 

TCA TGT AAC GCG GAT TTG ATT AAT AGA GTC TTG CTT GAT GCA GGC TTT 

Ser 6 Cys Asn Ala Asp Leu He Asn Arg Val Leu Leu Asp Ala Gly Phe 
3025 3030 3035 

ACA AAT GAA CTT GTC CAA AAT TAC TGG AGT AAG CAG AAA AAT ATC AAG 

Thr 4 Asn Glu Leu Val Gin Asn Tyr Trp Ser Lys Gin Lys Asn lie Lys 
3040 3045 305° 

GGA GTG AAA GCA CGA TTT GTT GTG ACT GAT GGT GGG ATT ACC AGA GTT 

Giv- 2 val Lys Ala Arg Phe Val val Thr Asp Gly Gly lie Thr Arg Val 
y 3055 3060 3065 

TAT CCC AAA GAG GCT GGA GAA AAT TGG CAA GAA AAC CCA GAG ACA TAT 

Tvr°pro Lys Glu Ala Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr Tyr 
3070 3075 3080 

GAG GAC AGC TTC TAT AAA AGG AGC CTA GAT AAT GAT AAC TAT GTT TTC 

2308 

Glu Asp Ser Phe Tyr Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val Phe 
3085 3090 3095 3100 

ACT GCT CCC TAC TTT AAC AAA AGT GGA CCT GGT GCC TAT GAA TCG <5GC 

?hr 6 Ala Pro Tyr Phe Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser Gly 
3105 3110 J 1 - 13 

ATT ATG GTA AGC AAA GCT GTA GAA ATA TAT ATT CAA GGG AAA CTT CTT 

Ue 4 Met Val Ser Lys Ala Val Glu lie Tyr lie Gin Gly Lys Leu Leu 
3120 3125 3130 

AAA CCT GCA GTT GTT GGA ATT AAA ATT GAT GTA AAT TCC TGG ATA GAG 

Lys 2 pro Ala Val Val Gly He Lys lie Asp Val Asn Ser Trp lie Glu 
3135 3140 3145 

AAT TTC ACC AAA ACC TCA ATC AGA GAT CCG TGT GCT GGT CCA GTT TGT 

Asn°Phe Thr Lys Thr Ser. lie Arg Asp Pro Cys Ala Gly Pro Val Cys 
3150 3155 3160 

GAC TGC AAA AGA AAC AGT GAC GTA ATG GAT TGT GTG ATT CTG GAT GAT 

Asp 8 Cys Lys Arg Asn Ser Asp Val Met Asp Cys Val He Leu Asp Asp 
3165 3170 3175 J1HU 

GGT GGG TTT CTT CTG ATG GCA AAT CAT GAT GAT TAT ACT AAT CAG ATT 

Gl^Gly Phe Leu Leu Met Ala Asn His Asp Asp Tyr Thr Asn <31n He 
. . 3185 3190 3195 

GGA AGA TTT TTT GGA -GAG ATT GAT CCC AGC TTG ATG AGA CAC CTG GTT 
2644 
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Gly Ara Phe Phe Gly Giu lie Asp Pro Ser Leu Met Arg His Leu Val 
3200 3205 3211 

AAT ATA TCA GTT TAT GCT TTT AAC AAA TCT TAT GAT TAT CAG TCA GTA 
2692 

Asn lie Ser Val Tyr Ala Phe Asn Lys Ser Tyr Asp Tyr Gin Ser Val 
3215 3220 3225 

TGT GAG CCC GGT GCT GCA CCA AAA CAA GGA GCA GGA CAT CGC TCA GCA 
2740 

Cys Glu Pro Gly Ala Ala Pro Lys Gin Gly Ala Gly His Arc Ser Ala 
3230 3235 3240 

TAT GTG CCA TCA GTA GCA GAC ATA TTA CAA ATT GGC TGG TGG GCC ACT 
2788 

Tyr Val Pro Ser Val Ala Asp lie Leu Gin lie Gly Trp Trp Ala Thr 
3245 3250 3255 3260 

GCT GCT GCC TGG TCT ATT CTA CAG CAG TTT CTC TTG AGT TTG ACC TTT 
2836 

Ala Ala Ala Trp Ser He Leu Gin Gin Phe Leu Leu Ser Leu Thr Phe 

3265 3270 3275 

CCA CGA CTC CTT GAG GCA GTT GAG ATG GAG GAT GAT GAC TTC ACG GCC 
2884 

Pro Arg Leu Leu Glu Ala Val Glu Met Glu Asp Asp Asp Phe Thr Ala 
3280 3285 3290 

TCC CTG TCC AAG CAG AGC TGC ATT ACT GAA CAA ACC CAG TAT TTC TTC 
2932 

Ser Leu Ser Lys Gin Ser Cys He Thr Glu Gin Thr Gin Tyr Phe Phe 
3295 3300 3305 

GAT AAC GAC AGT AAA TCA TTC AGT GGT GTA TTA GAC TGT GGA AAC TGT 
2980 

Asp Asn Asp Ser Lys Ser Phe Ser Gly Val Leu Asp Cys Gly Asn Cys 
3310 3315 3320 

TCC AGA ATC TTT CAT GGA GAA AAG CTT ATG AAC ACC AAC TTA ATA TTC 

3028 , ^ 

Ser Arg He Phe His Gly Glu Lys "Leu Met Asn Thr Asn Leu He Phe 
3325 3330 3335 3340 

ATA ATG GTT GAG AGC AAA GGG ACA TGT CCA TGT GAC ACA CGA CTG CTC 
3076 

He Met Val Glu Ser Lys Gly Thr Cys Pro Cys Asp Thr Arg Leu Leu 
3345 3350 3355 

ATA CAA GCG GAG CAG ACT TCT GAC GGT CCA AAT CCT TGT GAC ATG GTT 

3124 „ , 

He Gin Ala Glu Gin Thr Ser Asp Gly Pro Asn Pro Cys Asp Met Val 
3360 3365 3370 

AAG CAA CCT AGA TAC CGA AAA GGG CCT GAT GTC TGC TTT GAT AAC AAT 
3172 

Lys Gin Pro Arg Tyr Arg Lys Gly Pro Asp Val Cys Phe Asp Asn Asn 
3375 3380 3385 

GTC TTG GAG GAT TAT ACT GAC TGT GGT GGT GTT TCT GGA TTA AAT CCC 
3220 

Val Leu Glu Asp Tyr Thr Asp Cys Gly Gly Val S r Gly Leu Asn Pro 
3390 3395 3400 
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TCC CTG TGG TAT ATC ATT GGA ATC CAG TTT CTA CTA CTT TGG C" 

32 68 

Ser Leu Trp Tyr lie lie Gly He Gin Phe Leu Leu Leu Trp Leu Val 
3405 3410 3415 3420 

TCT GGC AGC ACA CAC CGG CTG TTA TGACCT 
3298 

Ser Gly Ser Thr His Arg Leu Leu 
3425 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1091 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ 10 NO:4: 

Met Ala Ala Gly Cys Leu Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser 
1 5 10 15 

Leu Leu He Gly Pro Ser Ser Glu Glu Pro Phe Pro Ser Ala Val Thr 
20 25 30 

lie Lvs Ser Trp Val Asp Lys Met Gin Glu Asp Leu Val Thr Leu Ala 
35 40 45 

Lvs Thr Ala Ser Gly Val Asn Gin Leu Val Asp He Tyr Glu Lys Tyr 
50 55 60 

Gin Asp Leu Tyr Thr Val Glu Pro Asn Asn Ala Arg Gin Leu Val Glu 
65 70 75 80 

lie Ala Ala Arg Asp He Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala 
85 90 95 

Leu Val Ser Leu Ala Leu Glu Ala 'Glu Lys Val Gin Ala Ala His Gin 
100 105 110 

Trp Arg Glu Asp Phe Ala Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys 
H5 120 125 

Asd Asp Leu Asp Pro Glu Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg 
130 135 140 

He Lys Pro Val Phe He Glu Asp Ala Asn Phe Gly Arg Gin He Ser 
145 150 155 160 

Tyr Gin His Ala Ala Val His He Pro Thr Asp He Tyr Glu Gly Ser 
165 l^O 1'5 

Thr lie Val Leu Asn Glu Leu Asn Trp Thr Ser Ala Leu Asp Glu Val 
180 185 190 

Phe Lys Lys Asn Arg Glu Glu Asp Pro Ser Leu Leu Trp Gin Val Phe 
195 200 205 

Gly Ser Ala Thr Gly Leu Ala. Arg Tyr Tyr Pro Ala Ser Pro Trp Val 
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210 215 220 

Asp Asn Ser Ara Thr Pro Asn Lys He Asp Leu Tyr Asp Val Arg Arg 
225 ' 230 235 240 

Ara Pro Trp Tyr He Gin Gly Ala Ala Ser Pro Lys Asp Met Leu He 
245 250 255 

Leu Val Asp Val Ser Gly Ser Val Ser Gly Leu Thr Leu Lys Leu He 

260 265 270 

Arg Thr Ser Val Ser Glu Met Leu Glu Thr Leu Ser Asp Asp Asp Phe 

275 280 285 

Val Asn Val Ala Ser Phe Asn Ser Asn Ala Gin Asp Val Ser Cys Phe 
290 295 300 

Gin His Leu Val Gin Ala Asn Val Arg Asn Lys Lys Val Leu Lys Asp 
305 310 315 320 

Ala Val Asn Asn He Thr Ala Lys Gly He Thr Asp Tyr Lys Lys Gly 
325 330 335 

Phe Ser Phe Ala Phe Glu Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala 
340 345 350 

Asn Cys Asn Lys He He Met Leu Phe Thr Asp Gly Gly Glu Glu Arg 
355 360 365 

Ala Gin Glu He Phe Asn Lys Tyr Asn Lys Asp Lys Lys Val Arg Val 
370 375 380 

Phe Arq Phe Ser Val Gly Gin His Asn Tyr Glu Arg Gly Pro He Gin 
385 390 395 400 

Trp Met Ala Cys Glu Asn Lys Gly Tyr Tyr Tyr Glu He Pro Ser He 
405 410 415 

Glv Ala He Arg He Asn Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg 
420 425 430 

Pro Met Val Leu Ala Gly Asp Lys Ala Lys Gin Val Gin Trp Thr Asn 
435 440 445 

Val Tyr Leu Asp Ala Leu Glu Leu Gly Leu Val He Thr Gly Thr Leu 
450 455 460 

Pro Val Phe Asn He Thr Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys 
465 470 475 480 

Asn Gin Leu He Leu Gly Val Met Gly Val Asp Val Ser Leu Glu Asp 
485 490 495 

He Lys Arg Leu Thr Pro Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr 
500 505 510 

Phe Ala He Asp Pro Asn Gly Tyr Ala Leu Leu His Pro Asn Leu Gin 
515 520 525 

Pro Lys Asn Pro Lys Ser Gin Glu Pro Val Thr Leu Asp Phe Leu Asp 
530 535 540 

Ala Glu Leu Glu Asn Asp He Lys Val Glu He Arg Asn Lys Met He 
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545 550 555 560 

Asp Gly Glu Ser Giv Glu Lys Thr Phe Arg Thr Leu Val lys Ser Gin 
^ 1 - ~ - 570 3 



565 



Asp Glu Ara Tyr lie Asp Lys Gly Asn Arg Thr Tyr Thr Trp Thr Pro 
585 =90 



580 



Val Asn Gly Thr Asp Tyr Ser Leu Ala Leu Val Leu Pro Thr Tyr Ser 
595 600 605 

Phe Tvr Tyr He Lys Ala Lys Leu Glu Glu Thr lie Thr Gin Ala Arg 
610 615 620 

Ser Lys Lys Gly Lys Met Lys Asp Ser Glu Thr Leu Lys Pro Asp Asn 

625 630 635 

r-i .. »Kr Dha Tie aj.3 Pro Ara Asd Tyr Cys Asn 
rne oxu o=x --- - - 655 

Asp Leu Lys lie Ser Asp Asn Asn Thr Glu Phe Leu Leu Asn Phe Asn 
P 660 665 6-7° 

Glu Phe lie Asp Arg Lys Thr Pro Asn Asn Pro Ser Cys Asn Ala Asp 
675 680 685 

Leu lie Asn Arg Val Leu Leu Asp Ala Gly Phe Thr Asn Glu Leu Val 
690 695 700 

Gin Asn Tyr Trp Ser Lys Gin Lys Asn He Lys Gly Val Lys Ala Arg 
705 710 715 7ZO 

Phe Val Val Thr Asp Gly Gly He Thr Arg Val Tyr Pro Lys Glu Ala 
725 730 735 

Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr Tyr Glu Asp Ser Phe Tyr 
- - - 745 /3U 



740 



Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val Phe Thr Ala Pro Tyr Phe 

y 755 760 _ 765 

Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser Gly lie Met Val Ser Lys 

770 775 780 

Ala Val Glu He Tyr He Gin Gly Lys Leu Leu Lys Pro Ala Val Val 

785 790 795 800 

Gly He Lys He Asp Val Asn Ser Trp lle Glu Asn Phe Thr Lys Thr 

1 J 810 815 



805 



Ser lie Arg Asp Pro Cys Ala Gly Pro Val Cys Asp Cys Lys Arg Asn 
820 825 »30 

Ser Asp Val Met Asp Cys Val He Leu Asp Asp Gly Gly Phe Leu Leu 
835 840 8 

Met Ala Asn His Asp Asp Tyr Thr Asn Gin He Gly Arg Phe Phe Gly 
850 855 860 

Glu He Asp Pro Ser Leu Met Arg His Leu Val Asn He Ser Val Tyr 
865 870 875 880 
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Ala Phe Asn Lys Ser Tyr Asp Tyr Gin Ser Val Cys Glu Pro Gly Ala 
885 B90 995 

Ala Pro Lvs Gin Gly Ala Gly His Arg Ser Ala Tyr Val Pro Ser Val 
" 900 905 910 

Ala Psp T "° Leu Gin He Gly Tro TrD Ala Thr Ala Ala Ala Trp Ser 
915 92*0 925 

He Leu Gin Gin Phe Leu Leu Ser Leu Thr Phe Pro Arg Leu Leu Glu 
930 935 940 

Ala Val Glu Met Glu Asp Asp Asp Phe Thr Ala Ser Leu Ser Lys Gin 
945 9S0 955 960 

Ser Cvs I^e Thr Glu Gin Thr Gin Tyr Phe Phe Asp Asn Asp Ser Lys 
965 970 975 

Ser Phe Ser Gly Val Leu Asp Cys Gly Asn Cys Ser Arg He Phe His 
980 985 990 

Gly Glu Lys Leu Met Asn Thr Asn Leu He Phe He Met Val Glu Ser 
995 1000 1005 

Lys Gly Thr Cys Pro Cys Asp Thr Arg Leu Leu He Gin Ala Glu Gin 
1010 1015 1020 

Thr Ser Asp Gly Pro Asn Pro Cys Asp Met Val Lys Gin Pro Arg Tyr 
1025 1030 1035 1040 

Ara Lvs Gly Pro Asp Val Cys Phe Asp Asn Asn Val Leu Glu Asp Tyr 
* 1045 1050 1055 

Thr Asp Cys Gly Gly Val Ser Gly Leu Asn Pro Ser Leu Trp Tyr He 
men 1065 1070 



15 



He Gly He Gin Phe Leu Leu Leu Trp Leu Val Ser Gly Ser Thr Hi 
1075 1080 1085 

Arg Leu Leu 
1090 

(2) INFORMATION FOR SEQ ID NO: 5: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 8.. 14 59 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTCCCCC ATG TAT GAC GAC TCC TAC GTG CCC GGG TTT GAG GAC TCG GAG 
49 
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Met Tyr Asp Asp Ser Tyr Val Pro Gly Phe Glu Asp Ser Glu 
1095 1100 1105 

GCG GGT TCA GCC GAC TCC TAC ACC AGC CGC CCA TCT CTG GAC TCA GAC 
97 

a Glv Ser Ala 

1110 

GTC TCC CTG GAG GAG GAC CGG GAG AGT GCC CGG CGT GAA GTA GAG AGC 

Val Ser Leu Glu Glu Asp Arg Glu Ser Ala Arg Arg Glu Val Glu Ser 
1125 H30 H35 

CAG GCT CAG CAG CAG CTC GAA AGG GCC AAG CAC AAA CCT GTG GCA TTT 

Gin Ala Gin Gin Gin Leu Glu Arg Ala Lys His Lys Pro Val Ala Phe 
1140 1145 II 50 

GCG GTG AGG ACC AAT GTC AGC TAC TGT GGC GTA CTG GAT GAG GAG TGC 

241 

Ala Val Arg Thr Asn Val Ser Tyr Cys Gly Val Leu Asp Glu Glu Cys 
1155 H60 1165 

CCA GTC CAG GGC TCT GGA GTC AAC TTT GAG GCC AAA GAT TTT CTG CAC 

Pro Val Gin Gly Ser Gly Val Asn Phe Glu Ala Lys Asp Phe Leu His 
H70 H75 HBO H85 

ATT AAA GAG AAG TAC AGC AAT GAC TGG TGG ATC GGG CGG CTA GTG AAA 

III Lys Glu Lys Tyr Ser Asn Asp Trp Trp He Gly Arg Leu Val Lys 
1190 1195 1200 

GAG GGC GGG GAC ATC GCC TTC ATC CCC AGC CCC CAG CGC CTG GAG AGC 

Glu Gly Gly Asp He Ala Phe He Pro Ser Pro Gin Arg Leu Glu Ser 
1205 1210 1215 

ATC CGG CTC AAA CAG GAG CAG AAG GCC AGG AGA TCT GGG AAC CCT TCC 

lie Arg Leu Lys Gin Glu Gin Lys Ala Arg Arg Ser Gly Asn Pro Ser 
1220 1225 1230 

AGC CTG AGT GAC ATT GGC AAC CGA CGC TCC CCT CCG CCA TCT CTA GCC 
481 

Ser Leu Ser Asp He Gly Asn Arg Arg Ser Pro Pro Pro Ser Leu Ala 
1235 1240 1245 

AAG CAG AAG CAA AAG CAG GCG GAA CAT GTT CCC CCA TAT GAC GTG GTG 



529 

Lys Gin Lys Gin Lys Gin Ala Glu His Val Pro Pro Tyr Asp Val Val 
1250 1255 1260 1265 

CCC TCC ATG CGG CCT GTG GTG CTG GTG GGA CCC TCT CTG AAA GGT TAT 

577 

Pro Ser Met Arg Pro Val Val Leu Val Gly Pro Ser Leu Lys Glv "Tyr 
1270 1275 1280 

GAG GTC ACA GAC ATG ATG CAG AAG GCT CTC TTC GAC TTC CTC AAA CAC 

625 

Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu Lys His 
1285 1290 1295 
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AGA TTT GAT GGC AGG ATC TCC ATC ACC CGA GTC ACA GCC GAC CTC TCC 
€73 

Arg Phe Asp Gly Arg He Ser lie Thr Arg Val Thr Ala Asp Leu Ser 
1300 1305 1310 

CTG GCA AAG CGA TCT GTG CTC AAC AAT CCG GGC AAG AGG ACC ATC ATT 
721 

Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Gly Lys Arg Thr He He 
1315 1320 1325 

GAG CGC TCC TCT GCC CGC TCC AGC ATT GCG GAA GTG CAG AGT GAG ATC 
769 

Glu Arg Ser Ser Ala Arg Ser Ser He Ala Glu Val Gin Ser Glu He 
1330 1335 1340 1345 

GAG CGC ATA TTT GAG CTG GCC AAA TCC CTG CAG CTA GTA GTG TTG GAC 
817 

Glu Arg He Phe Glu Leu Ala Lys Ser Leu Gin Leu Val Val Leu Asp 

1350 1355 1360 

GCT GAC ACC ATC AAC CAC CCA GCA CAG CTG GCC AAG ACC TCG CTG GCC 
865 

Ala Asp Thr He Asn His Pro Ala Gin Leu Ala Lys Thr Ser Leu Ala 
1365 1370 1375 

CCC ATC ATC GTC TTT GTC AAA GTG TCC TCA CCA AAG GTA CTC CAG CGT 
913 

Pro He He Val Phe Val Lys Val Ser Ser Pro Lys Val Leu Gin Arg 

1380 1385 1390 

CTC ATT CGC TCC CGG GGG AAG TCA CAG ATG AAG CAC CTG ACC GTA CAG 

Leu He Arg Ser Arg Gly Lys Ser Gin Met Lys His Leu Thr Val- Gin 
1395 1400 1405 

ATG ATG GCA TAT GAT AAG CTG GTT CAG TGC CCA CCG GAG TCA TTT GAT 
1009 

Met Met Ala Tyr Asp Lys Leu Val Gin Cys Pro Pro Glu Ser Phe Asp 

1410 1415 1420 1425 

GTG ATT CTG GAT GAG AAC CAG CTG GAG GAT GCC TGT GAG CAC CTG GCT 
1057 

Val He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His Leu Ala 
1430 1435 1440 

GAG TAC CTG GAG GTT TAC TGG CGG GCC ACG CAC CAC CCA GCC CCT GGC 
1105 . ^ wl n 

Glu Tyr Leu Glu Val Tyr Trp Arg Ala Thr His His Pro Ala Pro Gly 
1445 1450 1455 

CCC GGA CTT CTG GGT CCT CCC AGT GCC ATC CCC GGA CTT CAG AAC CAG 

Pro Gly Leu Leu Gly Pro Pro Ser Ala He Pro Gly Leu Gin Asn Gin 
1460 1465 1470 

CAG CTG CTG GGG GAG CGT GGC GAG GAG CAC TCC CCC CTT GAG CGG GAC 

1201 _ n 

Gin Leu Leu* Gly Glu Arg Gly Glu Glu His Ser Pro Leu Glu Arg Asp 
1475 1480 1485 

AGC TTG ATG CCC TCT GAT GAG GCC AGC GAG AGC TCC CGC CAA GCC TGG 
1249 
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Ser Leu Met Pro Ser Asp^Glu Ala Ser Glu Server Arg Gin Ala Tr^ 

ACA GGA TCT TCA CAG CGT AGC TCC CGC CAC CTG GAG GAG GAC TAT GCA 

Tartly Ser Ser Gin Arg Ser Ser Arg His Leu Glu Glu Asp Tyr Ala 
1510 1515 

GAT GCC TAC CAG GAC CTG TAC CAG CCT CAC CGC CAA CAC ACC TCG GGG 

Asp 5 Ala Tyr Gin Asp Leu Tyr Gin Pro His Arg Gin His Thr Ser Gly 
1525 1530 1 

CTG CCT AGT GCT AAC GGG CAT GAC CCC CAA GAC CGG CTT CTA GCC CAG 

Leu^ro Ser Ala Asn Gly His Asp Pro Gin Asp Arg Leu Leu Ala Gin 
1540 1545 15S0 

GAC TCA GAA CAC AAC CAC AGT GAC CGG AAC TGG CAG CGC AAC CGG CCT 

As^Ser Glu His Asn His Ser Asp Arg Asn Trp Gin Arg Asn Arg Pro 
H 1560 1565 



1555 



TGG CCC AAG GAT AGC TAC TGACAGCCTC CTGCTGC 
1476 

Trp Pro Lys Asp Ser Tyr 
1570 1575 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Tyr Asp Asp Ser Tyr Val Pro "Gly Phe Glu Asp Ser Glu Ala Gly 

Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Leu Asp Ser Asp Val Ser 
20 25 JU 

Leu Glu Glu Asp Arg Glu Ser Ala Arg Arg Glu Val Glu Ser Gin Ala 
35 40 4S 

Gin Gin Gin Leu Glu Arg Ala Lys His Lys Pro Val Ala Phe Ala Val 
50 55 60 

Arg Thr Asn Val Ser Tyr Cys Gly Val Leu Asp Glu Glu Cys Pro Val 

70 '5 



65 



Gin Gly Ser Gly Val Asn Phe Glu Ala Lys Asp Phe Leu His lie Lys 
85 90 y3 

Glu Lys Tyr Ser Asn Asp Trp Trp He Gly Arg Leu Val Lys Glu Gly 
100 105 11U 

Gly Asp He Ala Phe He Pro Ser Pro Gin Arg Leu Glu Ser He Arg 
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Ser Ala Asn Gly His Asp Pro Gin Asp Arg Leu Leu Ala Gin Asp Ser 
450 455 460 

Glu His Asn His Ser Asp Arg Asn Trp Gin Arg Asn Arg Pro Trp Pro 
465 470 475 480 

Lys Asp Ser Tyr 
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What is claimed is: 

1. An isolated nucleic acid encoding a calcium 
channel alB subunit, wherein the nucleic acid has the 
nucleotide A at position 194 , or the nucleotide G at 

position 2559. 

2. The isolated nucleic acid of Claim 1, 
wherein the nucleic acid has the nucleotide A at 

position 6470. 

3. The isolated nucleic acid of Claim 1, 
wherein the nucleic acid has a deletion of nucleotides 
4814-4819. 

4. The isolated nucleic acid of Claim 1, 
wherein the nucleic acid is isolatable from human 
cerebellum tissue. 

5. A vector comprising the isolated nucleic 

acid of Claim 1. 

6. A host cell comprising the vector of 

Claim 5. 

1. An isolated nucleic acid encoding a calcium 
channel b3 subunit, wherein the nucleic acid has at 
least one of the following: a G at position 4 6; a T 
at position 203; a T at position 300; a G at position 
303; a G at position 420; a T at position 438; a C at 
position 477; a G at position 486; an A at position 
534; a C at position 552; a T at position 561; an A at 
position 1064; a GC as positions 1283-4; and a T at 

position 1308. 

8. An isolated nucleic acid encoding a calcium 
channel b3 subunit, wherein the nucleic acid has the 
nucleotides TCC inserted at position 119, or the 
nucleotides ATG insert d at position 978. 

9. A v ctor comprising the isolated nucleic 
acid of Claim 7 . 
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10. A host cell comprising the vector of 
Claim 9. 

11. The isolated nucleic acid of Claim 7, 
wherein the nucleic acid is isolatable from human 
cerebellum tissue. 

12- A protein encoded the nucleic acid of 

Claim 1. 

13. An antibody to the protein of Claim 12. 

14. A fusion protein comprising a protein 
encoded by the nucleic acid of Claim 1. 

15. A protein encoded the nucleic acid of 
Claim 7 . 

16. An antibody to the protein of Claim 15. 

17. A fusion protein comprising a protein 
encoded by the nucleic acid of Claim 7. 

18. An isolated human calcium channel, 
comprising subunits encoded by the nucleic acids 
having the sequences of SEQ ID NO. 1, SEQ ID NO. 3, 
and SEQ ID NO. 5. 

19. A method of identifying an agent which 
modulates the activity of a calcium channel, 
comprising the steps of: 

a. maintaining a host cell that expresses 
a functional calcium channel in a solution containing 
the agent to be tested and a calcium channel-selectiv 
ion, wherein the functional calcium channel comprises 
an alB subunit having the nucleotide A at position 
194, or the nucleotide G at position 2559; or a b3 
subunit having a G at position 46; a T at position 
203; a T at position 300; a G at position 303; a G at 
position 420; a T at position 438; a C at position 
477; a G at position 486; an A at position 534; a C at 
position 552; a T at position 561; an A at position 
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1064; a GC as positions 1283-4; of a T at position 
1308; 

b. depolarizing the cell membrane of the 

host cell; and 

c. detecting the current flowing into 
the cell, wherein if the current that is detected 
differs from that produced by a control cell 
maintained in a solution containing the calcium 
channel-selective ion, but not the agent to be tested 
then the agent is an agent which modulated calcium 
channel activity* 

20. An agent identified by the method of 

Claim 19. 
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Segment from ACHALPKA1B (1 to 7266) co be cranslaced: S2:710S 
Segment from ACHALPHA1B (1 to 7266) to be translated: 

10 20 30 40 SO 60 

TCCGTGGCTG CTCCGCTCTG AGCGCCTGGC GCGCCCCGCG CCCTCCCTGC CGGGGCCGCT GGGCCGGG 

80 90 100 109 H8 

TGCACGCGGG GCCCGGGAGC CAfGG^CGC^^p^CTG^CGGC^TAT 

MET val Arg Phe Gly Asp Glu Leu Gly Gly Arg iyr 

136 145 154 163 172 181 

^ggcc^g^^^^^^^^^1?3Sg?5SaSg1? 
Gly Gly Pro Gly Ser Gly Glu Arg Ala Arg Gly Gly Gly Ala Gly Gly Ala oiy 

190 199 208 217 226 235 

GGC CC^ GGT CCC GGG GGG CTG CAG CCC GGC CAG val Leu Tvr Lvs Gin Ser 

Gly Pro Gly Pro Gly Gly Leu Gin Pro Gly Gin Arg Val Leu Tyr Lys Gin ser 

244 2S3 262 271 280 ^ 289 

lie Ala Gin Arg Ala Arg Thr MET Ala Leu Tyr Asn Pro He Pro Val Lye Gin 
298 307 316 32S 334 343 

AAC TGC TTC ACC GTC AAC CGC TCG CTC TTC GTC TTC AGC a«H Asn Val Val 

Asn Cys Phe Thr Val Asn Arg Ser Leu Phe Val Phe Ser Glu Asp Asn Val Val 

352 361 370 379 388 397 

CGC AAA TAG GCG AAG CGC ATC ACC GAG TGG CCT CCA TTC Glu" Tvr MET lie Leu 
Arg Lys Tyr Ala Lys Arg He Thr Glu Trp Pro Pro Phe Glu Tyr MET lie Leu 

406 415 424 433 442 451 

GCC ACC ATC ATC GCC AAC TG"C ATC GTG CTG GCC CTG rin His Leu Pro ASD 

Ala Thr He He Ala Asn Cys He Val Leu Ala L u Glu Gin His Leu Pro Asp 

460 469 478 487 496 505 

GGG GAC AAA ACG CCC ATG TCC GAG CGG CTG GAC GAC ACG GAG CCC TAT TTC ATC 

FIG. 
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Gly Asp Lys Thr Pro MET Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe He 
514 523 532 541 ^ 550 559 



568 



577 ' 586 595 604 613 

His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val MET Asp pne 

622 631 640 649 658 ^ 667 

Sg25E5555555S55S£2S 

676 685 694 703 712 721 

Ala Val Arg Val Leu Arg Pro Leu Lys Leu Val Ser Gly lie .ro 

„. n . R 757 766 775 

730 739 748 '=>' 

_ 820 829 
784 793 802 811 

838 847 856 865 874 883 

GGC AAG TTC CAC AAG GCC TGT Asn Ser Thr Aso Ala Glu Pro Val Oly 

Gly Lys Phe His Lys Ala Cys Phe Pro Asn Ser Thr Asp Jua *»xu 

892 901 910 919 «• - ^ 

GAC TTC CCC TGT GGC AAG GAG G^C Ara Leu Cys Glu Gly Asp Thr Glu 

Asp Phe Pro Cys Gly Lys Glu Ala Pro Ala Arg Leu Cys oj.u i»j.y v 



946 



955 964 973 982 991 



TGC CGG GAG TAC TGG CCA GGA CCC AAC «iv ^le Thr Asn Phe Asp Asn lie 
Cys Arg Glu Tyr Trp Pro Gly Pro Asn Phe Gly He Thr Asn rae v 

1000 1009 1018 1027 ^ 1036 1045 

CTG TTT GCC ATC TTG ACG GTG TTC CAG Jpp Thr MET Glu Gly Trp Thr Asp 

Leu Phe Ala He Leu Thr Val Phe Gin Cys He Thr MET Glu Gly irp 

1054 1063 1072 1081 1090 1099 

ATC CTC TAT AAT ACA AAC GAT GCG GCC rw Asn Thr Tro Asn Trp Le"u Tyr Phe 
He Leu Tyr Asn Thr Asn Asp Ala Ala Gly Asn Thr Trp Asn irp 

1108 1H7 1126 H35 1144 

ATC CCT CTC ATC ATC ATC GGC TCC TTC TTC ATG CTC AAC CTG GTG CTG GGC GTG 
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He Pro Leu He He II Gly Ser Phe Phe MET Leu Asn Leu Val Leu Gly Val 

1162 1171 H80 1189 ^1198 1207 

rTC TCG GGG GAG TTT GCC AAG GAG CGA GAG AGG GTG GAG AAC CGC CGC GCC TTC 
se? Phe Ala Lys Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe 

1216 1225 1234 1243 1252 ^1261 

1270 1279 1288 1297 • 1306 1315 

'Fgg ATC TTC AAG GCG GAG GAA GTC ATG CTG GCC GAG GAG SAC AGG AAT GCA 5^ 
Se Ly2 Ala Glu Glu Val MET Leu Ala Glu Glu Asp Arg Asn Ala Glu 

132 4 1333 1342 1351 ^1360 1369 

Far 115 TCC C^T TTG GAC GTG CTG AAG AGA GCG GCC ACC AAG AAG AGC AGA AAT 
S2 tJS ler pS LeS Xp Val Leu Lys Arg Ala Ala Thr Lys Lys Ser Arg Asn 

1378 1387 1396 1405 1414 1423 

g55S5HHS5Kg255E5523SE!S 

143 2 1441 1450 1459 1468 1477 

i486 1«9S 1513 l ^ _ 

Tyr Phe Arg Arg Lys Glu Lys MET Phe Arg Phe Phe lie Arg 

1S 40 1549 1S58 1567 1576 _«8S 

X594 1603 1612 1621 163° _ _ 16 ^ 

GTG GCC ATG GTG CAT TAC AAC CAG CCG CGG CGG CTT ACC ACG ACC CTG TAT TTT 
Val Ala MET Val His Tyr Asn Gin Pro Arg Arg Leu Thr Thr Thr Leu Tyr roe 

1648 16S7 1666 1675 1684 ^ 

CPA GAG TTT GTT TTC CTG GGT CTC T^C CTC ACA GAG ATG TCC CTG AAG JAT 
S3 vS Phe SS Gly Leu Phe Leu Thr Glu MET Ser Leu Lys MET Tyr 

1702 1711 1720 1729 1738 1747 

Gly Leu Gly Pro Arg Ser Tyr Phe Arg Ser Ser Phe Asn Cys Phe Asp Pne exy 

--'- 17 56 1765 1774 1783 1792 1801 

GTC ATC GTG GGG AGC GTC TTT GAA GTG GTC TGG GCG GCC ATC AAG CCG GGA AGC 
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Val He Val Oly Ser Val Phe Glu Val Val Trp Ala Ala He Lys Pro Gly S r 
1810 1819 1828 1837 1846 1855 

,„«».» 1fl qi 1900 1909 
1864 1873 1882 1891 ±* 

1918 !927 X936 IMS «* «« 

1972 1561 19*0 19" *^ 

Leu Leu Gly MET Gin Leu Phe Gly Gly Gin Phe Asn 

oft-A 2053 2062 2071 
2026 203S 2044 2053 

— — — = ■=== -==p rjrr CTC ACT GTC TTC CAG ATC CTG 

ACA ACC AAC TTC GAC ACC TTC CCT GCC GCC ATC CTC AUI ^ 
Thr Thr Asn Phe Asp Thr Phe Pro Ala Ala Ile Leu to. 

2098 ' 2107 2116 2125 

Thr Gly Glu Asp Trp Asn Ala val MET Tyr His uiy 

Val Ser Lys Gly MET Phe Ser Ser Phe Tyr Pne ne 

2188 2197 2206 2215 2*» «33 

Asn Tyr Thr Leu Leu Asn Val Phe Leu Ala lie 

22*2 22S1 2260 2269 »™ *™ 

Ala Gin Glu Leu Thr Lys Asp Glu Glu Glu MET eiu ^j-u 

2296 2305 2314 2323 »»» »« 

Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu val ser 



2350 2359 



2368 2377 2386 2395 



AAC ATC TCC ATC GCC GCC AGG CAG CAG ^AC Ala Lys Ala Arg Ser Val Trp 

Asn lie Ser lie Ala Ala Arg Gin Gin Asn Ser Ala Lys Ala Arg 

2422 2431 2440 2449 

GAG CAG CGG GCC AGC CAG CTA CGG CTG CAG AAC CTG CGG GCC AGC TGC GAG GCG 
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Glu Gin Arg Ala Ser Gin Leu Arg Leu Gin Asn Leu Arg Ala Ser Cys Glu Ala 

2458 2467 2476 2485 2494 2503 

CTG TAC AGC GAG ATG GAC CCC GAG GAG CGG CTG CGC TTC GCC ACT ACG CGC CAC 
Leu Tyr Ser Glu MET Asp Pro Glu Glu Arg Leu Arg Phe Ala Thr Thr Arg Hi« 

2512 2521 2530 2539 2548 2557 

CGG CGG CCC GAC AT"G AAG ACG CAC CTG GAC CGG CCG CTG GTG GTG GAG-$p GGC 
Arg Arg Pro Asp MET Lys Thr His Leu Asp Arg Pro Leu Val Val Gltf Eeu Gly 

2566 2575 2584 2S93 2602 2611 

CGC GAC GGC GCG CGG GGG CCC GTG GGA GGC AAA GCC CGA CCT GAG GCT GCG GAG 
"Arg Asp Gly Ala Arg Gly Pro Val Gly Gly Lys Ala Arg Pro Glu Ala Ala Glu 

2620 2629 2638 2647 2656 2665 

GCC CCC GAG GGC GTC GAC CCT CCG CGC AGG CAC CAC CGG CAC CGC GAC AAG GAC 
Ala Pro Glu Gly Val Asp Pro Pro Arg Arg His His Arg His Arg Asp Lys Asp 

2674 2683 2692 2701 2710 2719 

AAG ACC CCC GCG GCG GGG GAC CAG GAC CGA GCA GAG GCC CCG AAG GCG GAG AGC 
Lys Thr Pro Ala Ala Gly Asp Gin Asp Arg Ala Glu Ala Pro Lys Ala Glu Ser 

2728 2737 2746 2755 2764. *773 

GGG GAG CCC GGT GCC CGG GAG GAG CGG CCG CGG CCG CAC CGC AGC CAC AGC AAG' 
Gly Glu Pro Gly Ala Arg Glu Glu Arg Pro Arg Pro His Arg. Set- S*c Ser L;ys 

2782 2791 2800 2809 2818 2827 

GAG GCC GCG GGG CCC CCG GAG GCG CGG AGC GAG CGC GGC CGA GGC CCA GGC CCC 
Glu Ala Ala Gly Pro Pro Glu Ala Arg Ser Glu Arg Gly Arg Gly Pro Gly Pro 

2836 2845 2854 2863 2872 2881 

GAG GGC GGC CGG CGG CAC CAC CGG CGC GGC TCC CCG GAG GAG GCG GCC GAG CGG 
Glu Gly Gly Arg Arg His His Arg Arg Gly Ser Pro Glu Glu Ala Ala Glu Arg 

2890 2899 2908 2917 2926 2935 

GAG CCC CGA CGC CAC CGC GCG CAC CGG CAC CAG GAT CCG AGC AAG GAG TGC GCC 
Glu Pro Arg Arg His Arg Ala His Arg His Gin Asp Pro Ser Lys Glu Cys Ala 

2944 2953 2962 2971 2980 2989 

GGC GCC AAG GGC GAG CGG CGC GCG CGG CAC CGC GGC GGC CCC CGA GCG GGG CCC 

Oly Ala Lys Gly Glu Arg Arg Ala Arg His Arg Gly Gly Pro Arg Ala Gly Pro 
2996 3007 3016 3025 3034 3043 

CGG GAG GCG GAG AGC GGG GAG GAG CCG GCG CGG CGG CAC CGG GCC CGG CAC AAG 

Arg Glu Ala Glu Ser Gly Glu Glu Pro Ala Arg Arg His Arg Ala Arg His Lys 

3052 3061 3070 3079 3088 3097 

GCG* CAG CCT GCT CAC GAG GCT GTG GAG AAG GAG ACC ACG GAG AAG GAG GCC ACG 
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Ma Gin Pro Ala His Glu Ala Val Glu Lys Glu Thr Thr olu Lys Glu Ala Thr 

3124 3133 3 ^__ 3 !!1 

3160 3169 3178 31B7 3196 3205 

3223 3232 3241 3250 3259 

-■SgSg553522SS5KKSSSSS 

3331 3340 3349 3358 _ 3367 

222S2S2SS23g5S5SSS 

3403 ^3412 3421 

= 7=?: 7?rr TFF rivA GCA GAG GGG AAG AAG GAG 

35 5 5 52 £J £5 ?S £5 £5 52 S «» ^ ». <*. 

3O0 3439 3446 3457 3«* »«™ 

Val Glu Ala Aso Aso Val MET Arg Ser Gly Pro Arg reo 

3502 «U »I20 .3525 

55555255555555535 SS55S 

3547 3556 3S« _3574 SMI 

Val Thr MET Arg Tyr Phe Glu Val Val He Leu vax 

He Ala Leu Ala Ala Glu Asp Pro Val Arg Thr Asp ser m s 

3673 3112 »«l 

Leu Lys Tyr Leu Ast) Tyr He Phe Thr Gly Val Phe Thr Pne oj.u » 

37 oo 3709 3 7ia 3727 3736 37« 

AAG ATG ATC GAC TTG GGA CTG CTG CTT CAC CCT GGA GCC TAT TTC CGG GAC TTG 
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Lys MET He Asp Leu Gly Leu Leu Leu His Pro Gly Ala Tyr Phe Arg Asp Leu 
3754 3763 3772 3781 3790 3799 

Sp Asn lie Leu Asp Phe lie Val Val Ser Gly Ala Leu Val Ala Phe Ala Phe 

3808 3817 3826 3835 3844 3853 

t?S GGA TCC AAA GGG AAA GAC ATC AAT ACC ATC AAG TCT CTG AGA GTC.-CXT CGT 
sS ct? llr Lys Gly Lys Asp He Asn Thr lie Lys Ser Leu Axg Val Leu Arg 

3862 3871 3880 3889 3898 3907 

3916 3925 3934 3943 3952 3961 

GAC TGT GTG GTG AAC TCC CTG AAG AAT GTC CTC AAC ATC ^3 Val Tvr MET 

Asp Cys Val Val Asn Ser Leu Lys Asn Val Leu Asn He Leu lie Val Tyr MET 

3970 3979 3988 3997 4006 4015 

4024 4033 4042 4051 4M0 4069 

TTC TAC TGC ACA GAT GAA TCC AAG GAG CTG GAG AGG GAC TGC AGG G&T CAG JAj£ 
Phe Tyr Cys tS Sp SlS Ser Lys Glu Leu Glu Arg Asp Cys Arg Gly Gin Tyr 

4078 4087 4096 4105 4114 4123 

TTG GAT TAT GAG AAG GAG GAA GTG GAA GCT CAG CCC AGG «f!?f! lvs Tvr 

SS Asp Tyr Glu Lys Glu Glu Val Glu Ala Gin Pro Arg Gin Trp Lys Lys Tyr 

4132 4141 4150 , 41S9 4168 ^4177 

5Sc TTT CAC TAC GAC AAT GTG CTC TGG GCT CTG CTG ACG CTG TTC ACA GTG TCC 
Sp SE -25 xy? Sp £n Sal Leu Trp Ala Leu Leu Thr Leu Phe Thr Val Ser 

4186 4195 4204 4213 4222 4 231 

ACG GGA GAA GGC TGG CCC ATG GTG CTG AAA ^AC TCC jvbo Ala Thr Tyr Glu 

Thr Gly Glu Gly Trp Pro MET Val Leu Lys His Ser Val Asp Ala Thr Tyr aiu 

4240 4249 42S8 4267 4276 4 285 

GAG CAG GGT CCA AGC CCT GGG TAC CGC ATG GAG CTG TCC phe Tvr Val Val 

Glu Gin Gly Pro Ser Pro Gly Tyr Arg MET Glu Leu Ser lie Phe Tyr Val val 

4294 4303 4312 4321 4330 4339 

TAC TTT GTG GTC TTT CCC TTC TTC TTC GTC AAC ph3 val Ala Leu lie lie 

Tyr Phe Val Val Phe Pro Phe Phe Phe Val Asn He Phe Val Ala Leu lie lie 

4348 43S7 4366 4375 4384 4393 

ATC 3£CC TTC CAG GAG CAG GGG GAC AAG GTG ATG" TCT GAA TGC AGC CTG GAG AAG 
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, 1( Thr Ph e G!n Glu 01. Oly *«, Lys Val MET ser Glu cy, Ser Leu Glu Ly* 

4429 _ 4 !!1 4 !*I 

EKS5S5S5SH525S5S5SSS3S 

4465 4474 4483 4 *^ 

4546 4555 

, C1Q 4528 4537 
4510 4519 

..em 4600 4€0S 
4582 4591 tow 
4564 4573 "° 

MET MET Lys Phe Tyr Asp Ala Pro Tyx xy 

464S 4654 466 
4618 4627 4fi3S 

4690 4699 4708 ^ 



* - _ — — ji 

4726 4 ?3S 4744 4753 47 

a o i c - 4 825 



47B0 — — 

4888 4897 4906 4915 *"* 

«•« i9 Z t9 z 

Gly MET Oln Val Phe Oly Aen lie Ala Leu Asp «sp 

* - 5032 S041 

—•4996 5005 5014 5023 

CAC AAC AAC TTC CGG ACG TTT^ TTG CAA GCC CTG ATG CTG CTG TTC AGG AGC GCC 



4769 "98 * S07 
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His Asn Asn Phe Arg Thr Phe Leu Gin J&a^eu MET Leu Leu Phe Arg Ser Ala 

50S0 5059 5068 5077 5086 5095 

ACG GGG GAG GCC TGG CAC GAG ATC" ATG CTG TCC TGC CTG AGC AAC 

Thr Sly Glu Ala Trp His Glu lie MET Leu Ser Cys Leu Ser Asn Gin Ala cys 

5104 5113 5122 5131 5140 5149 

GAT C^AG CAG GCC AAT GCC ACC GAG TGT GGA AGT GAC TTT GCC TTC^C TTC 

Asp Glu Gin Ala Asn Ala Thr Glu Cys Gly Ser Asp Phe Ala Tyr Phe Tyr Phe 

5158 5167 5176 5185 5194 5203 

GTC TCC TTC ATC TTC CTG TGC TCC TTT CTG ATG TTG AAC CTC TTT GTG GCT 
"Val Ser Phe He Phe Leu Cys Ser Phe Leu MET Leu Asn Leu Phe Val Ala Val 

5212 5221 5230 5239 5248 52S7 

ATC ATG GAC AAT TTT GAG TAC CTC ACG CGG GAC TCT TCC ATC CTA GGT CCT CAC 
He MET Asp Asn Phe Glu Tyr Leu Thr Arg Asp Ser Ser He Leu Gly Pro His 

S266 5275 5284 5293 5302 5311 

CAC TTG GAT GAG TTC A^fC CGG GTC TGG GCT GAA TAC GAC CCG G^T GCG TGT GGG 
His Leu Asp Glu Phe He Arg Val Trp Ala Glu Tyr Asp Pro Ala Ala Cys Gly 

5320 5329 5338 S347 5356 5365 

CGC ATC AGT TAC AAT GAC ATG TTT GAG ATG CTG AAA CAC ATG TCC CCG CCT 
Arg He Ser Tyr Asn Asp MET Phe Glu MET Leu Lys His MET Ser Pro Pro Leu 

5374 5383 5392 5401 5410 5419 

GGG CTG GGG AAG AAA TGC CCT GCT CGA GTT GCT TAC AAG CGC CTG GTT ^GC 
Gly Leu Gly Lys Lys Cys Pro Ala Arg Val Ala Tyr Lys Arg Leu Val Arg MET 

5428 5437 5446 5455 5464 5473 

AAC CCC ATC TCC AAC GAG GAC ATG ACT" GTT ^AC TTC ^£ ^EH thH Leu MET 

Asn MET Pro He Ser Asn Glu Asp MET Thr Val His Phe Thr Ser Thr Leu MET 

5482 S491 5500 5509 5518 5527 

GCC CTC ATC CGG ACG GCA CTG GAG ATC AAG CTG GCC CCA GCT GGG ACA AAG CAG 
Ala Leu He Arg Thr Ala Leu Glu He Lys Leu Ala Pro Ala Gly Thr Lye Gin 

5536 5545 5554 5563 5572 5581 

CAT. CAG TGT GAC GCG GAG TTG AGG AAG GAG ATT TCC GTT GTG TGG GCC AAT CTG 
His Gin Cys Asp Ala Glu Leu Arg Lys Glu He Ser Val Val Trp Ala Asn Leu 

5590 5599 5608 5617 5626 5635 

CCC CAG AAG ACT TTG GAC TTG CTG GTA CCA CCC CAT AAG CCT GAT GAG ATG ACA 
Pro Gin Lys Thr Leu Asp Leu Leu Val Pro Pro His Lys Pro Asp Glu MET Thr 

5644 5653 5662 5671 5680 5689 

GTG G^G AAG GTT TAT GCA GCT CTG ATG ATA TTT GAC TTC TAC. AAG CAG AAC AAA 
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Val Gly Lys Val Tyr Ala Ala Leu MET lie Phe Asp Phe Tyr Lys Gin Asn Lys 
5698 5707 5716 5725 5734 5743 

c-7-tq 5788 5797 
5752 5761 5770 5779 = 

S222252iSSS2S25255B 

, , „ cni-i 5842 5851 

5806 5815 5824 5B33 

25 25522 2; 5? 52 25 2525255555.552 

eafQ «?R-7fi 5887 5896 5905 
5860 5869 5B78 

s914 5923 5932 59" 

«b™»7?25222E3SS£SS" 

Thr Gin Arg Thr Gin Asp Ala Pro His Giu Axa. «rg 

^«as 5995 6004 6013 
5968 5977 5986 

25255HS5552555555555.-.SS2 

60 22 6031 "40 6049 6058 «« 

MET Gin Ser lie Thr Arg Arg Gly Pro Asp Gly Glu fro 

60 76 6085 6094 . 6103 6112 .^121 

"aGC CAG GGT CGA GCG GCC TCC jj^j^ pro Arq Leu Ala Ala Glu Thr Gin Pro V&l 
Ser Gin Sly Arg Ala Al» Ser MET Pro Arg l*u Ala 

61 30 6139 6148 6157 6166 ««* 

Thr Aep Ale Ser pro MET Lye Arg Ser lie ser Tnr i*u *u 

S18 . 6193 6202 6211 6220 6229 

w » m « w « »= » g 25 52 S-5 £ 32 K •? 

Gly Thr His Leu Cys Ser Thr Thr Pro Asp Arg Pro pro rxo o 

6 238 6247 6256 "65 6274 _«83 

Ser His His His His His Arg Cys His Arg Arg Arg Asp Arg Lys ^ 

..-6292 6301 6310 6319 6328 "37 

CTG SS5 AAG GGG CCC AGC CTG TCT GCC GAT ATG GAT GGC GCA CCA AGC AGT GCT 
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oi» ,y. Gly Pro S« Leu Ser Ala Asp MET Asp Sly Ala Pro Ser Ser Ale 

"Z "Z 

*«ib 6427 6436 6445 
6400 6409 6418 642/ 

6454 6463 6 2! 

,,,, 6535 6544 6SS3 
6508 6517 6526 653b 

«»•» 6 f!l "Z 

««« s lf! 6 ffi 

"Z "Z 

A^^^AH^^|^^^«c^ty S TSSaA^IerIS^ 

Thr Pro Leu Thr Pro Arg Pro Ser lie Tax iyr 

cm 6742 : 6751 6760 6769 
6724 6733 » 

fSS^SlfyASlSS^rSiSSS.IS^Sy^Su 

•™ " os *^ "Z 

«.» 6841 6850 "59 «« 

^^^^GScTH^«T55=|=T^|S^2SSy^S™ 
Pro Leu Ala Pro Gly Ser Arg lie Gly Ser Asp rro *-y 

6.35 6904 6913 •»» S ^ ' 

r.: 6940 6 » 9 6556 «m "Z 

gS^^A^A^f^^^T^T^A^ACfTCCTACGTOTCCTCCCIG 
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* _ Thr Ser Tyr Val Ser Ser I*eu 
^ V.! M. T ta »- S« «, Se, Ser A« « W 

69 94 7003 7012 7021 

- — = TTr GGT TAC CAC TGC ACC CTG 

ACC TCC CAG TCT CAC CCT CTC CGC CGC GTG CCC AAC G ^ ^ ^ Thr Jj«u 
5*r Ser Gin Ser His Pro Leu Arg *rg 

7048 7057 7066 7075 



7145 71S5 

7102 



7115 7125 7135 



> *„„™r.»r.i rccCTGCATG CAGCAGGCGT GTGTTCCAGT 

CAC TGG TGC TAG CTGCACCGTG ACCGCTCAGA CG CCTGCATG 

K±s Trp Cys 72Q? 7215 72 



7165 7175 _ rrtCAGGCCTT GCCCACCTTG GTGAGGCT' 

GGATGAGTTT TATCATCCAC ACGGGGCAGT CGGCCCTCGG GGGAGGCCTT 

7235 724S 7255 7265 

TGTGGCCCCT CCCTCCCCCT CCTCCCCTCT TTTACTCTAG A 

RANSLATE [Partial] : q 
EQ: rec 

nd recording to seq.rec? (<CR>-Yes> 

nding recording for user SHUEYD at 7-Mar-95 3:22pm 



7185 7195 
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utilizes che universal genetic code unless modified with the 
EDIT - CODONS command . 



ACHALPHA2 



Frame -1 
Frame -2 
Frame -3 



745:774 

20:3295 
201:257 
588 :662 
1362:1427 
1701:1856 
2574 :2630 



1354 : 1440 

378:455 
726:800 
1491:1520 
2106:2171 
2656:2756 



1786:1938 

474 :560 
1062 :1277 
1545:1634 
2193 :2288 
2868 :2966 



561:587 
1302:1328 
1665:1697 
2292:2366 
3153 :3215 



Would you like to see the display of open reading frames? (<CR>-No) 



ACHALPHA2' 

Frame -4 : 
Frame -5 : 

Frame -6 : 



- Numbering increases in 5' 



to 3' direction 



304:336 
62:115 
2519:2575 

240:272 
1227:1259 
1662:1730 
2976 :2999 



568:612 
179:307 
2765:2797 

348:392 
1338 :1502 
1920:1958 



724:831 
392:616 
2831:2908 

549:635 
1512:1562 
2400:2474 



2779:3177 
1874 :2011 
3224 :END 
1041:1097 
1575 :1628 
2532:2702 



Would you like to see the display of open reading frames? <.<CR>-No> 



Segment from ACHALPHA2 (1 to 3298) to be translated: 20 3295 
Segment from ACHALPHA2 (1 to 3298) to be translated: 



10 



19 



28 



37 



46 



55 



TTGATCTTCG ATCGCGAAG ATG GCT GCT GGC TGC CTG CTG GCC TTG ACT CTG ACA 

MET Ala Ala Gly Cys Leu Leu Ala Leu Thr Leu Tnr 



64 



73 



82 



91 



100 



109. 



Ul 127 136 14S 154 1« 

TIG. 2 
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val T*x II. W «r Trp val up Lys MET Gin Glu Asp Leu V.1 Thx Leu Ala 
17J 181 "0 I" ^ 

* ™ !2. 

£eu Tyr Thr Val Glu Pro Asn Asn Ala Arg Gin Leu 

, ao 307 316 325 
280 289 298 307 

— — -tt^t ==5? TaS GCC CTG GTG AGC CTG GCA TTG 

GAT ATT GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC u Ser M Ala Leu 

Sine Glu Lys Leu Leu Ser Asn Arg Ser Lye AJ-a us 

«, 361 370 379 
334 343 352 361 

a** 415 424 433 
388 397 406 415 

469 478 487 
442 451 460 469 ^ 



496 505 



e*5i 532 541 

514 523 



550 559 



ctt 586 • 595 

568 S77 s " 



GGC TCA ACA ATT GTG TTA AAT GAA CTC AAC TGG ACA ACT GCC TTA GAT GAA GTT 

Gly Ser Thr He Val Leu Asn Glu Leu Asn Trp tut » 

631 "0 649 
604 613 622 631 

f!i !!! 

Ala Thr Gly Leu Ala Arg Tyr Tyr Pro Ala Ser Pro 

730 739 748 757 

ACT CCA AAT AAG ATT GAC CTT am Val ATO jSg Arg Pro Trp Tyr lie Gin 

Thr Pro Asn Lys He Asp Leu Tyr Asp Val Arg Arg 

784 793 802 811 
766 775 7a * 

r=s TTT GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA AGT GTT 

GGA GCT GCA TCT CCT AAA GAC ATG CTT au 
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Gly Ala Ala Ser Pro Lys Asp MET Leu lie Leu Val Asp Val Ser Gly Ser Val 
820 829 838 847 856 865 

AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA ATC TOA GAA ACC 
Ser Gly Leu Thr Leu Lys Leu lie Arg Thr Ser Val Ser Glu MET Leu Glu Thr 

874 883 892 901 910 919 

CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT AAC AGC AAT GCT CAG GAT 
Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe Asn Ser Asn Ala Qln Asp 

928 937 946 955 964 973 

GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA AAT GTA AGA AAT AAA AAA GTG TO 
Val Ser Cys Phe Gin His Leu Val Gin Ala Asn -Val Arg Asn Lys Lys Val Leu 

982 991 1000 1009 101B 1027 

m^^^^nf ^J^^nAGGAWf 

Lys Asp Ala Val Asn Asn He Thr Ala Lys Gly He Thr Asp Tyr Lys Lys Gly 

1036 1045 10S4 1063 1072 1081 

TTT AGT mOTm^^OTm^m^OTTCCAaGCAJW^ 
Phe Ser Phe Ala Phe Glu Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys 

1X17 1126 1135 



1090 1099 1108 



AAT AAG ATT ATT ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA 
Asn Lys lie lie MET Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu lie 

1144 1153 1162 1171 H80 1189 

TTT" AAC AAA TAC AAT AAA GAT AAA AAA GTA CGT j^"^ ^ Phe Ser Val Gly 

Phe Asn Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 



1198 1207 1216 



1225 1234 1243 



Gin His Asn Tyr Glu Arg Gly Pro He Gin Trp MET Ala Cys Glu Asn Lys Gly 
1252 1261 1270 1279 12*88 1297 

TAT TAT TAT GAA ATT CCT TCC ATT" GGT GCA ATA ~^GA ThE Gin Glu Tvr 

Tyr Tyr Tyr Glu He Pro Ser He Gly Ala He Arg He Asn Thr Gin Glu Tyr 

1306 1315 1324 1333 1342 1351 

TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA Ff5!j Val 

Leu Asp Val Leu Gly Arg Pro MET Val Leu Ala Gly Asp Lys Ala Lys Gin Vel 

1360 1369 1378 1387 1396 MPS- 

CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG CTG GGA v^i Tie Thr G^Y 

Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu Glu Leu Gly Leu Val He Thr Gly. 

.... 1414 142 3 1432 1441 1450 1459 

ACT*" CTT CCG GTC TTC AAC ATA ACC GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG 
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», ^ Pro V.1 Ph. A» He Thr Gly 01= Ph. Glu A.» Ly. Thr «= L.U Ly. 

M « 1«77 »« "»» ^ *™ 

is2= 1540 1549 

• — - == -rsr T=fr TEC cEE 55 555 fE TAC TTT GCA ATC GAT 

1S76 15B5 1594 1603 I*" «» 

™__Z?L 

i z "Z 

17«_^_l™ 17" 

1..1 ' I." ^ 

, ae . , a73 1882 1891 
1846 1855 1864 1873 

"Z 

ser Lys Lye Gly Lya MET Lye Asp Ser oxu » 

Olu ser Gly Tyr Thr Phe He Ala Pro Arg Asp xyr 

2035 2044 20S3. 
2008 2017 2026 2U3& 

__— - ==r rrs =sp TTF GAG TTT ATT GAT AGA AAA 

12 IS 5S S Si ™ S 2 2 2 » n «» »>. n. «p «, «. 

20S2 2071 20.0 2089 _J096 2107 

ACT*" CCA AAC AAC CCA TCA TGT AAC GCG* GAT TTG ATT" AAT AGA GTC TTG CTT GAT 
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Thr Pro Asn Asn Pro Ser Cys Asn Ala Asp Leu He Asn Arg val Leu Leu Asp 
2116 2125 2134 2143 2152 2161 

GCA GGC TTT ACA AAT GAA CTT GTC CAA AAT TAC TGG AGT AAG CAG AAA AAT AXC 
Ala Gly Phe Thr Asn Glu Leu Val Gla Asn Tyr Trp Ser Lys Gin Lys Asn 11 

2170 2179 2188 2197 2206 2215 

AAG GGA GTG AAA GCA CGA TTT GTT GTG ACT GAT GGT GGG ATT ACC AGA GTT TAT 
££s Gly Val Lys Ala Arg Phe Val Val Thr Asp Gly Gly lie Thr Arg Val Tyr 

2224 2233 2242 2251 2260 2269 

CCC AAA GAG GCT GGA GAA AAT TGG CAA GAA AAC CCA GAG ACA 3^T GAG GAC 
So Lys Glu Ala Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr Tyr Glu Asp Ser 

2278 2287 2296 230S 2314 2323 

TTC TAT AAA AGG AGC CTA GAT AAT GAT AAC TAT GTT" TTC ACT GCT CCC TAC 
Phe Tyr Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val Phe Thr Ala Pro Tyr Phe 

2332 2341 2350 2359 2368 2377 

AAC AAA AGT GGA CCT GGT GCC TAT GAA TCG GGC ATT ATG GTA AGC AAA GCT CTA 
Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser Gly He MET Val Ser Lys Ala Val 

2386 2395 2404 2413 2422 2431 

Glu He Tyr He Gin Gly Lys Leu Leu Lys Pro Ala Val Val Gly lie Lys ixe 
2440 2449 2458 2467 2476 2485 

GAT GTA AAT TCC TGG ATA GAG AAT T"TC ^CC AAA ACC ^rle Ara ASD Pro CVB 

Asp Val Asn Ser Trp He Glu Asn Phe Thr Lys Thr Ser He Arg Asp Pro cys 

2494 2503 2512 _ 2521 2530 . 2539 

GCT GGT CCA GTT TGT GAC TGC AAA AGA 5AC AGT ^AC ^A ntet Asn Cvs Val Xle 
Ala Gly Pro Val Cys Asp Cys Lys Arg Asn Ser Asp Val MET Asp Cys Val lie 

2548 2557 2566 2575 2584 2S93 

CTG GAT GAT* GGT GGG TTT CTT" CTG ATG GCA AAT CAT GAT GAT TA^" jjjj^ 

Leu Asp Asp Gly Gly Phe Leu Leu MET Ala Asn His Asp Asp Tyr Thr Asn Gin 

2602 2611 2620 2629 2638 2647 

ATT GGA AGA TTT* TT*T GGA GAG ATT" GAT CCC AGC TTG ATG AGA CAC CTG j2n 
He Gly Arg Phe Phe Gly Glu He Asp Pro Ser Leu MET Arg His Leu Val A*n 

2656 2665 2674 2683 2692 2701 

ATAT^OTTjttGCTmAACATETCTT^ 

He Ser Val Tyr Ala Ph Asn Lys Ser Tyr Asp Tyr Gin S r Val Cys Glu Pro 

2710 2719 2728 2737 2746 27S5 

Gg¥"gCT C^CS]^"^^^^^^^ 555 ™ 555 CCA TCA GTA 
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Gly Ala Ala Pro Lys Gin Gly Ala Gly His Arg Ser Ala Tyr Val Pro Ser Val 
2764 2773 2782 2791 2800 2809 

555222555555555222 

2818 2827 2836 2845 _ 2 ™ ^ 

5222222222252SS5SB 

28 72 2881 2890 2899 2908 2917 

- — — === t= Tvv 5 Trr ctg TCC AAG CAG AGC TGC ATT ACT GAA CAA 

«« oys n« <*» 

2 „6 2935 2844 "S3 ^JK2 ^ ^2971 

Thr Gin Tyr Phe Phe Asp Asn Asp Ser Lys ser fcrie a« y 

OQQfl 3007 3016 3025 
2980 2989 2998 juu/ 

255255552255555555 

3034 3043 3052 3061 ^JOTO 3079 

252555222222SSSaS2 

3088 3097 3106 311S 3124 3133 

3142 3151 3160 * 3169^ 3178 .3187 

319 6 3205 3214 3223 3233 3241 

Thr Asp cya Gly Gly Val Ser Gly Leu Aan Pro Ser Leu txp 

„„ , 2fi8 3277 3286 329S 
3250 3259 3268 ' ' _ > 

He Gin Phe Leu Leu Leu Trp Leu Val Ser Gly Ser Tnr »xs 



CCT 
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n«fflETA3 (1 to 1476) to be translated: 
Segment from DSHBETA3 n ^ 5S 

19 28 37 

10 —z tz H£5 q]£c TCG GAG GCG GGT 

«** & ^2SS^?!-SSSS «» ~* - «. ^ *r 



73 82 



V3 w *• ^^^^ 

64 ^ -j= TCA GAC GTC TCC CTG GAG 

Ser Ala Asp Ser iyr *» 154 1« 

i»e l* s 



12 7 136 

118 . — — -r« "grg GCT CAG CAG CAG CXC 



* 72 —= TFF AAT GTC AGC TAC 

Glu Arg Ala Lys His Lys Pro a ^ 271 



190 1" 



235 



226 ^ H=C TCT GGA GTC AAC TTT GAG 



2a9 298 307 

2 80 289 « -TTr gac TGG TGG ATC GGG 

Ala Lys Asp Phe Leu His xxe u Y ^ 379 

« V M JDl 



334 343 HI — t?F CCC AGC CCC CAG CGC CTG 

""■■SPSSSSS S SS 35 S «. cm - 

Arg Leu Val Lys Glu Gly eiy asp _ ^ 433 



352 361 



406 415 



MSsasseasssassBM 

4S1 460 469 

Ser Leu Ser Asp He Gly Asn ^9 » ^ s41 

5X4 523 

495 505 tcc WG CGO 

jbwm-wSSSSSSS JESS'S SS~«« 

Lys Gin Lys Gin Ala Glu His vax w 
Y • SS9 56 e 577 58 

•Pro Val Val Leu Val Gly Pro »e 

622 631 640 



604 613 

FIG. 3 
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— „ ~~r* mr A.GA. TTT GAT GGC AGG ATC TCC ATC 

Si SS 55 S 25 £2 £s £ « Gly ~g n. s.r n 



658 «67 



676 685 «94 703 



ACC CGA GTC ACA GCC GAC ser Leu • Ala Lys* Arg Ser Val Leu Asa Asn Pro 
Thr Arg val Thr Ala Asp Leu Ser Leu Ala Lys 



Tin 739 748 _ 757 
712 721 730 ^ 73 

„. 703 802 811 
766 775 784 793 

— 7T 715= GCC* AAA TCC CTG CAG CTA GTA GTG 

s s is s as si s i£ a; is 5? <»° — . vrt 

***** J ^^" 056 865 

355555255523555325 

•» !!! 

«5S5352S522i!::a2a52S 

« 6 HI 

Arg Ser Arg Gly Lys Ser Sin MET Lye Hie Leu tbi 

»°» l « ^ 

Asp Lys Leu Val Gin Cys Pro Pro Glu Ser Pne « P 

™ — 

Oln Leu Glu Asp Ala Cys Glu His Leu Ala oau 

10 90 1099 110.. "» l ^ __"!!! 

Ale Thr Hie Hie Pro Ale fro Gly Pro Gly Leu Leu wj 

aiM n« . _ _ 

Pro Gly Leu Gin Aen Gin Gin Leu Leu Gly Glu Arg uiy 

"?» ^ 

CTT GAG CGG Sac agc jj^® pro Ser Asp Glu Ale Ser Glu Ser Ser Arg Gin 
Leu -Glu Arg- Asp Ser Leu MET Pro ser asp 

1.279 1288 1297 
X2S2 1261 1270 
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_ r-r-r- nc CTG GAG GAG GA.C TAT GCA 

s sssssssssssss s si ss s - - -p ^ 



m si is si ss 25 a ss s si 5SS 

l414 1423 I* 32 1441 ' _ 14 H - 

ss-sassaKSSSsissssss 

1472 

TGA CAGCCTCCTG CTGC 
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